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L5 ANSWER 1 OF 6 MEDLINE on STN 

AN 1999432178 MEDLINE 
DN PubMed ID: 10500122 

TI Reverse biochemistry: use of macromolecular protease inhibitors to dissect 
complex biological processes and identify a membrane-type serine protease 
in epithelial cancer and normal tissue. 

AU Takeuchi T; Shuman M A; Craik C S 

CS Department of Pharmaceutical Chemistry, University of California, San 

Francisco, CA 94143, USA. 
NC CA71097 (NCI) 

CA72006 (NCI) 

SO Proceedings of the National Academy of Sciences of the United States of 

America, (1999 Sep 28) 96 (20) 11054-61. 

Journal code: 7505876. ISSN: 0027-8424. 
CY United States 

DT Journal; Article; (JOURNAL ARTICLE) 

LA English 

FS Priority Journals 

OS GENBANK- AF 133086 

EM 199910 

ED Entered STN: 19991101 

Last Updated on STN: 20000303 
Entered Medline: 19991021 

AB Serine proteases of the chymotrypsin fold are of great interest because 
they provide detailed understanding of their enzymatic properties and 
their proposed role in a number of physiological and pathological 
processes. We have been developing the macromolecular inhibitor ecotin to 
be a "fold-specific" inhibitor that is selective for members of the 
chymotrypsin- fold class of proteases. Inhibition of protease activity 
through the use of wild-type and engineered ecotins results in inhibition 
of rat prostate differentiation and retardation of the growth of human 
PC-3 prostatic cancer tumors. In an effort to identify the proteases that 
may be involved in these processes, reverse transcription-PCR with PC-3 
poly (A) + mRNA was performed by using degenerate oligonucleotide primers. 
These primers were designed by using conserved protein sequences unique to 
chymotrypsin- fold serine proteases. Five proteases were identified: 
urokinase-type plasminogen activator, factor XII, protein C, trypsinogen 
IV, and a protease that we refer to as membrane -type 
serine protease 1 (MT-SPl 

) . The cloning and characterization of the MT-SPl 

cDNA shows that it encodes a mosaic protein that contains a transmembrane 
signal anchor, two CUB domains, four LDLR repeats, and a serine protease 
domain. Northern blotting shows broad expression of MT- 
SPl in a variety of epithelial tissues with high levels of 
expression in the human gastrointestinal tract and the prostate. A 
His -tagged fusion of the MT-SPl protease domain was 

expressed in Escherichia coli, purified, and autoactivated. Ecotin and 
variant ecotins are subnanomolar inhibitors of the MT- 
SPl activated protease domain, suggesting a possible role for 
MT-SPl in prostate differentiation and the growth of 
prostatic carcinomas. 

^ L5 ANSWER 2 OF 6 MEDLINE on STN 

AN 1999303582 MEDLINE 
DN PubMed ID: 10373425 

TI Purification and characterization of a complex containing 

matriptase and a Kunitz-type serine protease inhibitor from human 
milk. 

AU Lin C Y; Anders J; Johnson M; Dickson R B 

CS Lombardi Cancer Center, Georgetown University Medical Center, Washington, 



D.C. 20007, USA . 
NC 1P50CA58158 (NCI) 

P30-CA51005 (NCI) 

P30-CA51008 (NCI) 
SO Journal of biological chemistry, (1999 Jun 25) 274 (26) 

18237-42. 

Journal code: 2985121R. ISSN: 0021-9258. 
CY United States 

DT Journal; Article; (JOURNAL ARTICLE) 

LA English 

FS Priority Journals 

EM 199907 

ED Entered STN: 19990727 

Last Updated on STN: 20000303 
Entered Medline: 19990715 

AB Matriptase, a trypsin-like serine protease with two potential 

regulatory modules (low density lipoprotein receptor and complement Clr/s 
domains), was initially purified from T-47D breast cancer cells. Given 
its plasma membrane localization, extracellular matrix-degrading activity, 
and expression by breast cancer cells, this protease may be involved in 
multiple aspects of breast tumor progression, including cancer invasion. 
In breast cancer cells, matriptase was detected mainly as an 
uncomplexed form; however, low levels of matriptase were 
detected in complexes. In striking contrast, only the complexed 
matriptase was detected in human milk. The complexed 
matriptase has now been purified. Amino acid sequences obtained 
from the matriptase-associated proteins reveal that they are 
fragments of a Kunitz-type serine protease inhibitor that 'was previously 
reported to be an inhibitor of the hepatocyte growth factor activator. In 
addition, matriptase and its complexes were detected in 

milk-derived, SV40 T-antigen-immortalized mammary luminal epithelial cell 
lines, but not in human foreskin fibroblasts or in HT-1080 fibrosarcoma 
cells. These results suggest that the milk-derived matriptase 
complexes are likely to be produced by the epithelial components of the 
lactating mammary gland in vivo and that the activity and function of 
matriptase may be differentially regulated by its cognate 
inhibitor, comparing breast cancer with the lactating mammary gland. 
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DN PubMed ID: 10373424 

TI Molecular cloning of cDNA for matriptase, a matrix-degrading 

serine protease with trypsin-like activity. 
AU Lin C Y; Anders J; Johnson M; Sang Q A; Dickson R B 

CS Lombardi Cancer Center, Georgetown University Medical Center, Washington, 

D. C. 20007, USA. 
NC 1P50CA58158 (NCI) 

P30-CA51005 (NCI) 

P30-CA51008 (NCI) 

SO Journal of biological chemistry, (1999 Jun 25) 274 (26) 18231-6. 

Journal code: 2985121R. ISSN: 0021-9258. 
CY United States 

DT Journal; Article; (JOURNAL ARTICLE) 

LA English 

FS Priority Journals 

OS GENBANK-AF1 18224 

EM 199907 

ED Entered STN: 19990727 

Last Updated on STN: 20000303 

Entered Medline: 19990715 
AB A major protease from human breast cancer cells was previously detected by 



gelatin zymography and proposed to play a role in breast cancer invasion 
and metastasis. To structurally characterize the enzyme, we isolated a 
cDNA encoding the protease. Analysis of the cDNA reveals three sequence 
motifs: a carboxyl -terminal region with similarity to the trypsin-like 
serine proteases, four tandem cysteine-rich repeats homologous to the low 
density lipoprotein receptor, and two copies of tandem repeats originally 
found in the complement subcomponents Clr and Cls. By comparison with 
other serine proteases, the active-site triad was identified as His-484, 
Asp-539, and Ser-633. The protease contains a characteristic 
Arg-Val-Val-Gly-Gly motif that may serve as a proteolytic activation site. 
The bottom of the substrate specificity pocket was identified to be 
Asp-627 by comparison with other trypsin-like serine proteases. In 
addition, this protease exhibits trypsin-like activity as defined by 
cleavage of synthetic substrates with Arg or Lys'as the PI site. Thus, 
the protease is a mosaic protein with broad spectrum cleavage activity and 
two potential regulatory modules. Given its ability to degrade 
extracellular matrix and its trypsin-like activity, the name 
matriptase is proposed for the protease. 

L5 ANSWER 4 OF 6 CAPLUS COPYRIGHT 2005 ACS on STN 
AN 1999:617620 CAPLUS 

TI Structural basis of the interaction between matriptase and its 

cognate Kunitz-type inhibitor. 
AU Lin, Chen-Yong; Enyedy, Istvan J.; Wang, Shaomeng; Dickson, Robert D. 
CS Lombardi Cancer Center, Georgetown University, Washington, DC, 2 0007, USA 
SO Book of Abstracts, 218th ACS National Meeting, New Orleans, Aug. 22-26 ( 

1999) , MEDI-246 Publisher: American Chemical Society, Washington, 

D. C. 

CODEN: 67ZJA5 
DT Conference; Meeting Abstract 
LA English 

AB Matriptase (MTP) is a trypsin-like serine protease proposed to 

play a role in breast cancer invasion and metastasis. MTP is found in 
human milk as a complex with a Kunitz-type inhibitor (KSPI) , while in 
human breast cancer cells MTP and KSPI are mainly in uncomplexed form. 
Non-reduced/reduced diagonal gel electrophoresis of the uncomplexed and 
complexed MTP revealed that the complexed MTP is a two-chain protease; in 
contrast, the uncomplexed MTP is a single-chain protease. The cleavage 
site was identified to be the Arg residue that is located in the conserved 
activation motif (Arg-Val -Val-Gly-Gly) . These results suggest that 
conversion of the single-chain MTP to two-chain MTP is required for the 
interaction between MTP and the KSPI. Computer modeling revealed that the 
KSPI Kunitz domain I, but not domain II is the functionally inhibitory 
domain that interacts with MTP. This is consistent with the observation 
that the 4 0-kDa fragment of the KSPI that exhibits binding affinity to MTP 
contains the Kunitz domain I, but does not contain the intact Kunitz 
domain II. Thus, the serine protease domain in active configuration of 
MTP and the Kunitz domain I of the KSPI are required for their 
interaction. 
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TI TADG-15: an extracellular serine protease 

overexpressed in breast and ovarian carcinomas 
IN O'Brien, Timothy J.; Tanimoto, Hirotoshi 
PA The Board of Trustees of the University of Arkansas, USA 
SO PCT Int. Appl., 71 pp. 
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DT Patent 
LA English 



FAN.CNT 35 

PATENT NO. 



KIND DATE 



APPLICATION NO. 



DATE 



PI 


WO 


9942120 






Al 


19990826 


WO 


1999- 


US3436 




19990218 <- 






W: AU, 


CA, 


JP 




















RW: AT, 


BE, 


CH, 


CY, 


DE, DK, ES, 


FI, FR, GB, 


GR, IE, 


IT, 


LU, MC, NL, 






PT, 


SE 




















US 


5972616 






A 


19991026 


US 


1998- 


27337 




19980220 <- 




CA 


2324993 






AA 


19990826 


CA 


1999- 


2324993 




19990218 <- 




AU 


9928692 






Al 


19990906 


AU 


1999- 


28692 




19990218 <- 




AU 


750932 






B2 


20020801 














EP 


1056466 






Al 


20001206 


EP 


1999- 


909502 




19990218 




EP 


1056466 






Bl 


20040121 
















R: AT, 


BE, 


CH, 


DE, 


DK, ES, FR, 


GB, GR, IT, 


LI, LU, 


NL, 


SE, MC, PT, 






IE, 


FI 




















AT 


258060 






E 


20040215 


AT 


1999- 


909502 




19990218 




PT 


1056466 






T 


20040630 


PT 


1999- 


909502 




19990218 




ES 


2215376 






T3 


20041001 


ES 


1999- 


909502 




19990218 




US 


6649741 






Bl 


20031118 


US 


2000- 


654600 




20000901 




us 


2004086910 




Al 


20040506 


US 


2003- 


600187 




20030620 


PRAI 


us 


1998-27337 




A ■ 


19980220 














WO 


1999-US3436 




W 


19990218 














us 


1999-421213 




A3 


19991020 














us 


2000-654600 




A3 


20000901 













AB The invention provides a cDNA encoding a novel human extracellular serine 
protease termed Tumor Antigen Derived 
Gene-15 (TADG-15) . The cDNA 

sequence as well as the corresponding deduced amino acid sequence of human 
TADG-15 are disclosed. Also disclosed is a vector 
capable of expressing the human TADG-15 when 

transfected into a foreign host cell. In addition, a method (nucleic acid 
hybridization) for detecting the level of TADG-14 gene mRNA is disclosed. 
The TADG-15 serine protease has been found to be 

overexpressed in breast and ovarian carcinomas. PCR primers specific for 
the TADG-15 gene were constructed and used to show 
expression of TADG-15 mRNA in ovarian carcinomas. Due 
to the extracellular nature of the TADG-15 serine 

protease, it may be possible to exploit its expression as a diagnostic 
tool for ovarian cancer. 
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L5 ANSWER 6 OF 6 BIOSIS COPYRIGHT (c) 2005 The Thomson Corporation. on STN 
AN 2000:294636 BIOSIS 
DN PREV200000294636 

TI TADG-15: an extracellular serine protease 

overexpressed in breast and ovarian carcinomas. 
AU O'Brien, Timothy J. [Inventor, Reprint author] ; Tanimoto, Hirotoshi 

[Inventor] 
CS Little Rock, AR, USA 

ASSIGNEE: The Board of Trustees of the University of Arkansas, Little 

Rock, AR, USA 
PI US 5972616- October 26, 1999 

SO Official Gazette of the United States Patent and Trademark Office Patents, 

(Oct. 26, 1999) Vol. 1227, No. 4. e-file. 

CODEN: 0GUPE7. ISSN: 0098-1133. 
DT Patent 
LA English 

ED Entered STN: 6 Jul 2 000 

Last Updated on STN: 7 Jan 2002 
AB The present invention provides a DNA encoding a TADG-15 



protein selected from the group consisting of: (a) isolated DNA which 

encodes a TADG-15 protein; (b) isolated DNA which 

hybridizes to isolated DNA of (a) above and which encodes a TADG 

-15 protein; and (c) isolated DNA differing from the isolated 

DNAs of (a) and (b) above in codon sequence due to the degeneracy of the 

genetic code, and which encodes a TADG-15 protein. 

Also provided is a vector capable of expressing the DNA of the present 
invention adapted for expression in a recombinant cell and regulatory 
elements necessary for expression of the DNA in the cell. 



L13 ANSWER 1 OF 4 MEDLINE on STN 

AN 94080664 MEDLINE 
DN PubMed ID: 8258142 

TI An ovalbumin pept ide-specif ic cytotoxic T cell clone with antigen 

self -presentation capacity uses two distinct mechanisms to kill target 
cells . 

AU Dick T; Reichmann G; Ebnet K; Simon M M; Dienes H P; Echternacher B; 

Krammer P H; Reske-Kunz A B 
CS Inst i tut fur Immunologic, Johannes Gutenberg Universitat, Mainz, Germany. 
SO Cellular immunology, (1993 Dec) 152 (2) 333-47. 

Journal code: 1246405. ISSN: 0008-8749. 
CY United States 

DT Journal; Article; (JOURNAL ARTICLE) 

LA English 

FS Priority • Journals 

EM 199401 

ED Entered STN: 1994 02 03 

Last Updated on STN: 20000303 
Entered Medline: 19940114 

AB Cloned 10BK.1 T cells with specificity for the ovalbumin peptide 

OVA257-264 are representative of a novel cell type within the CD8+ subset 
of T cells. In the presence and in the absence of added antigen 
presenting cells these T cells react toward antigen (Ag) by proliferation 
and lymphokine production. These data suggest self -presentation of the Ag 
by 10BK.1 cells. Here we present evidence that 10BK.1 cells exhibit 
cytotoxic activity that involves two different cytotoxic effector 
mechanisms, (i) One mechanism is fast killing activity, apparent within 4 
hr. Constitutive mouse T cell -specif ic proteinase-1 (MTSP- 
1) activity, constitutive expression of MTSP-1 

RNA, increased by Ag challenge, and Ag-inducible perforin RNA expression 
were observed. Electron microscopic dense granules of the CTL were 
oriented toward Ag-pulsed target cells. The fast form of cytotoxicity was 
triggered by Ag recognition and by contact with IL-2. (ii) The other 
mechanism is slow cytolytic activity, manifested within 2 days. This 
activity was contained in the supernatant of 10BK.1 cells after Ag 
activation. It was inhibited by monoclonal anti-TNF antibodies and 
therefore presumably represents TNF alpha/beta. Cytotoxic T cells capable 
of antigen self -presentation may be responsible for tissue damage during 
bacterial and viral infections. 

L13 ANSWER 2 OF 4 MEDLINE on STN 

AN 92223391 MEDLINE 
DN PubMed ID: 1562736 

TI A T-cell -related proteinase expressed by T-lymphoma cells activates their 

endogenous pro-urokinase . 
AU Brunner G; Vettel U; Jobstmann S; Kramer M D; Schirrmacher V 
CS German Cancer Research Center, Institute of Immunology and Genetics, 

Heidelberg . 
SO Blood, (1992 Apr 15) 79 (8) 2099-106. 

Journal code: 7603509. ISSN: 0006-4971. 
CY United States 

DT Journal; Article; (JOURNAL ARTICLE) 
LA English 

FS Abridged Index Medicus Journals; Priority Journals 
EM 199205 

ED Entered STN: 19920607 

Last Updated on STN: 20000303 

Entered Medline: 19920515 
AB In this report, we investigated the expression and activation of 

proteolytic enzymes by mouse T-lymphoma cell lines of differing metastatic 



potential. In contrast to the low metastatic Eb line, the metastatic 
variants ESb and ESb-MP secreted urokinase-type plasminogen activator 
(u-PA) , which was present in the culture supernatant predominantly in the 
active form ( ESb, 96%; ESb-MP , 80%) . All three T-lymphoma variants 
expressed, a mainly cell surface-associated proteinase, which proved to be 
immunologically and enzymatically related to the murine T-cell-associated 
serine proteinase-1 (MTSP-1) . Intact lymphoma cells 

were able to activate the recombinant human proenzyme of u-PA (pro-u-PA) 
by a plasmin- independent mechanism, because plasmin contamination of the 
cells was not detectable. When ESb-MP cells were cultured in the presence 
of inhibitors of MTSP-1, such as antithrombin III, 

Pro-Phe-Arg-chloromethylketone, or aprotinin, the ratio of endogenously 
activated murine u-PA to inactive pro-u-PA in conditioned medium was 
significantly reduced (from 80% to 15%) . The most potent inhibitor, 
antithrombin, did not inhibit plasmin-catalyzed pro-u-PA activation. 
These results suggest a novel autocrine mechanism of plasmin- independent 
pro-u-PA activation for metastatic T lymphomas by the production of an 
MTSP-l-related proteinase. The ability to initiate the 
proteolytic cascade of plasminogen activation in the absence of plasmin 
might contribute to the metastatic behavior of these cells observed in 
vivo. 

L13 ANSWER 3 OF 4 BIOSIS COPYRIGHT (c) 2005 The Thomson Corporation. on STN 
AN 1999:97562 BIOSIS 
DN PREV199900097562 

TI Cloning and expression of TADG-15, a novel serine 

protease expressed in ovarian cancer. 
AU O'Brien, T. J.; Underwood, L. J.; Tanimoto, H. 
CS Univ. Arkansas Med. Sci., Little Rock, AR 72205, USA 
SO Tumor Biology, (Aug., 1998) Vol. 19, No. SUPPL. 2, pp. 33. print. 

Meeting Info.: 26th Meeting of the International Society for 

Oncodevelopmental Biology and Medicine. Umea, Sweden. August 30-September 

4, 1998. 

ISSN: 1010-4283. 
DT Conference ; (Meet ing) 

Conference; Abstract; (Meeting Abstract) 
LA English 

ED Entered STN: 4 Mar 1999 

Last Updated on STN: 4 Mar 1999 
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AN 1998:198282 BIOSIS 
DN PREV199800198282 

TI Cloning and expression of TADG-15, a novel serine 

protease expressed in ovarian cancer. 
AU Tanimoto, H. ; Underwood, L. J.; Clarke, Y. ; O'Brien, T. J. 
CS Univ. Arkansas Med. Sci., Litte Rock, AR 72205, USA 
SO Proceedings of the American Association for Cancer Research Annual 

Meeting, (March, 1998) Vol. 39, pp. 648. print. 

Meeting Info.: 89th Annual Meeting of the American Association for Cancer 

Research. New Orleans, Louisiana, USA. March 28-April 1, 1998. American 

Association for Cancer Research. 

ISSN: 0197-016X. 
DT Conference ; (Meet ing) 

Conference; Abstract; (Meeting Abstract) 
LA English 

ED Entered STN: 4 May 1998 

Last Updated on STN : 4 May 1998 
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TI Potent and selective inhibition of membrane- type 
serine protease 1 by human single-chain * 
antibodies • 

AU Sun Jeonghoon; Pons Jaume; Craik Charles S 

CS Department of Pharmaceutical Chemistry, University of California, San 

Francisco, 513 Parnassus, San Francisco, California 94143, USA. 
NC CA72006 (NCI) 

SO Biochemistry, (2003 Feb 4) 42 (4) 892-900. 

Journal code: 0370623. ISSN: 0006-2960. 
CY United States 

DT Journal; Article; (JOURNAL ARTICLE) 

LA English 

FS Priority Journals 

EM 200303 

ED Entered STN : 20030129 

Last Updated on STN: 20030328 
Entered Medline: 20030327 

AB Specific human antibodies targeting proteases expressed on 

cancer cells can be valuable reagents for diagnosis, prognosis, and 

therapy of cancer. To this end, a phage-displayed antibody 

library was screened against a cancer-associated serine protease, 

MT-SP1. A protein inhibitor of serine proteases that 

binds to a defined surface of MT-SP1 was used in an 

affinity-based washing procedure. Six antibodies were selected 

on the basis of their ELISA profiles and ability to serve as useful 

immunological reagents. The apparent K(i) , indicative of the potency of 

the antibodies at inhibiting human MT-SP1 

activity, ranged from 50 pM to 129 nM. Two of the antibodies 

had approximately 800-fold and 1500-fold selectivity when tested against 

the most homologous serine protease family member, mouse MT- 

SP1, that exhibits 86.6% sequence identity. Surface plasmon 

resonance was used as an independent means of determining the binding 

constants of the six antibodies. Association rates were as high 

as 1.15 x 10(7) s (-) (1) M{-)(1), and dissociation rates were as low as 3 . 8 

x 10(-) (4) s(-) (1) . One antibody was shown to detect denatured 

MT-SP1 with no cross reactivity to other family members 

in HeLa or PC3 cells. Another antibody recognized the enzyme in 

human prostate tissue samples for immunohistochemistry analysis. The mode 

of binding among the six antibodies and the protease was 

analyzed by competition ELISA using three distinctly different inhibitors 
that mapped the enzyme surface. These antibodies constitute a 
new class of highly selective protease inhibitors that can be used to 
dissect the biological roles of proteolytic enzymes as well as to develop 
diagnostic and therapeutic reagents. 
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TI Immunoassay for activated matriptase and its use in 

cancer diagnosis and treatment 
IN Dickson, Robert B.; Lin, Chen-Yong; Benaud, Christelle; Oberst, Michael 
PA USA 

SO U.S. Pat. Appl. Publ., 41 pp. 

CODEN: USXXCO 
DT Patent 
LA English 



FAN.CNT 1 

PATENT NO. KIND DATE APPLICATION NO. DATE 



PI US 2003170245 Al 20030911 US 2002-87987 20020305 

PRAI US 2002-87987 20020305 

AB The present invention provides an in vitro method of diagnosing the 
presence of a pre-malignant lesion, a malignancy, or other pathol . 
condition, in a subject, which is characterized by the presence of 
activated matriptase including the steps of: (A) obtaining a 
biol . sample from a subject that is to be tested for a pre-malignant 
lesion, a malignancy, or other pathol. condition; (B) exposing the biol. 
sample to a detectable agent which recognizes and binds to activated 
matriptase; and (C) determining whether said detectable agent is bound 
to the biol. sample. Preferably, the detectable agent is an 
antibody which specifically binds to activated matriptase 

More preferably the antibody is selected from M69 and M123. 
The invention also provides a method of treating malignancies, 
pre-malignant conditions, and pathol. conditions in a subject which are 
characterized by the activated form of matriptase including 
administering a therapeutically effective amount of an agent capable of 
blocking the activity of active matriptase. 
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TI Sequences of human matriptase- like serine protease and 

diagnostic and therapeutic uses thereof 
IN Xiao, Yonghong 

PA Bayer Aktiengesellschaf t , Germany 
SO PCT Int. Appl., 14 0 pp. 
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AB Reagents which regulate human matriptase- like serine protease 

activity and reagents which bind to human matriptase-like serine 
protease gene products can be used to regulate extracellular matrix 
degradation Such regulation is particularly useful for treating cancer, 
including metastasis of malignant cells, chronic obstructive pulmonary 
disease, disorders of the peripheral or central nervous system, and 
cardiovascular disorders. 
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AB A method is provided for inhibiting carcinoma progression in which 
matriptase plays a role in a subject in need of such inhibition. 
The method includes administering to a subject an effective amount of a 
compound comprising two pos . charged groups, which are the same or 
different. The groups are linked by a chemical group having a length of 5-30 
A, and preferably 15-24 A. Diagnostic methods based on matriptase 
action and therapeutic methods involving inhibition of matriptase 
activity are provided. 
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AB The present invention relates to the compns . , methods, and applications of 
a novel approach to selective cellular targeting. The purpose of this 
invention is to enable the selective delivery and/or selective activation 
of effector mols. to target cells for diagnostic or therapeutic purposes. 
The present invention relates to multi-functional prodrugs or targeting 
vehicles wherein each functionality is capable of enhancing targeting 
selectivity, affinity, intracellular transport, activation or 
detoxification. The present invention also relates to ultralow dose, 
multiple target, multiple drug chemotherapy and targeted immunotherapy for 
cancer treatment . 
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AB The present invention provides cDNA encoding a TADG-15 
(tumor antigen -derived gene 

15) serine protease from human that is overexpressed in ovarian 
tumors. TADG-15 appears to be a transmembrane 
multidomain serine protease. The TADG-15 cDNA is 3147 

base pairs long encoding for a 855 amino acid protein. Also provided is a 
vector capable of expressing the DNA of the present invention adapted for 
expression in a recombinant cell and regulatory elements necessary for 
expression of the DNA in the cell. The present invention further provides 
for methods of inhibiting TADG-15 expression and/or 
protease activity, methods of detecting TADG-15 mRNA 
and/or protein and methods of screening for TADG-15 

inhibitors. Addnl . , the present invention provides for cell -specif ic 
targeting via TADG-15 and methods of vaccinating an 
individual against TADG-15. The methods described are 

useful in the diagnosis, treatment and prevention of cancer, particularly 
breast and ovarian cancer. 
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AB This invention provides cDNA and encoded amino acid sequences of a novel 
membrane -type serine protease (designated MT-SP1) 

elevated expression of which is associated with cancer. In one embodiment, 
this invention provides a method obtaining a prognosis or of detecting or 
staging a cancer in an organism. The method involves providing a biol . 
sample from the organism and detecting the level of a membrane- 
type serine protease 1 (MT 

-SP1) in the sample, where an elevated level of the 

membrane-type serine protease, as compared to the level of the protease in 
a biol. sample from a normal healthy organism indicates the presence or 
stage of the cancer. 
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AB The invention is directed to a method of detecting a malignancy or a 

pre-malignant lesion in breast or other tissue, or a pathol . condition, by 
detecting the presence of single-chain or two-chain forms of 
matriptase in the tissue. The invention is further directed to a 
method of treating malignancies, which have the phenotype of 
matriptase production by administering a tumor formation inhibiting 
effective amount of a concentrate of Bowman-Birk ~ inhibitor (BBIC) , or other 
matriptase inhibitor. The invention also is directed to nucleic 
acids encoding a matriptase protein or fragments thereof, and 
their use for structure elucidation and modeling to identify other 
inhibitors of matriptase, as well as to methods of identifying 
matriptase modulating agents, including activators and inhibitors. 
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12249015 |dbi |BAB2037 6.1| 


prostamin [Homo sapiens] 
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10257390 | qb IAAG15395. 1 | 


serine protease TADG15 [Homo sap... 
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556373451 reflXP 508863.1| 


PREDICTED: similar to matripta. . . 
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suppression of tumorigenicity 1... 
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16758444|ref |NP 446087.1| 


suppression of tumorigenicity . . . 
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ST14 protein [Homo sapiens] 
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PREDICTED: similar to suppress... 
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Stl4-A-prov protein [Xenopus lae. . . 
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Unknown (protein for MGC: 81690) ... 
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ST14 protein [Homo sapiens] 
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unnamed protein product [Tetrao. . . 
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serine protease SNC19 [Homo sapi... 
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unnamed protein product [Tetrao. . . 
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qi | 33186804 | tpe | CAD67577 . 1 | TPA: type II transmembrane seri... 365 4e-99 El 

qi I 34869328 | ref | XP 221464.2| similar to hypothetical protei... 364 5e-99 B 

qi I 27369619 | ref |NP 766043.11 transmembrane serine protease . . . 363 2e-98 B 

qij 57109624 | ref |XP 545095.11 PREDICTED: similar to type II ... 360 8e-98 

qi I 50729728 | ref | XP 416635.11 PREDICTED: similar to type II ... 350 le-94 B 

qi | 556207961 ref | XP 516646.11 PREDICTED: similar to type II . . . 319 2e-85 

qi| 348669101 ref | XP 235768.2| similar to type II transmembra. . . 292 3e-77 B 

qi | 47217321 1 emb 1 CAG12529 . 1 1 unnamed protein product [Tetrao... 287 8e-76 

qi|55647901|ref |XP 524042. 1| PREDICTED: similar to transmem. . . 279 2e-73 

qi I 34862709 | ref [XP 234914.21 similar to polyserase 1A prote. . . 277 8e-73 B 

qi I 33667063 | ref |NP 892018.1| transmembrane serine protease . . . 277 le-72 13 

qi 150760956 | ref |XP 425880.11 PREDICTED: similar to transmem... 277 le-72 13 

qi | 33341912 | emb | CAD35759 . 1 | polyserase-IB protein [Homo sap... 277 le-72 13 

qi| 571022101 ref | XP 542189.11 PREDICTED: similar to transmem. . . 268 4e-70 

qi| 475758341 ref | NP 001001259.11 enteropeptidase, light chai . . . 245 4e-63 B 

qil 627699|pir | IA53663 enteropeptidase (EC 3.4.21.9) precurs... 245 4e-63 

qi 155656832 [ref 1 XP 514836.1| PREDICTED: enterokinase [Pan t. . . 244 le-62 

qi I 55655639 | ref |XP 531395.11 PREDICTED: enterokinase [Pan t. . . 244 le-62 

qi 1 1589367 |prfl [2211228A enteropeptidase 242 3e-62 

qi I 27806097 | ref |NP 776864.1| protease, serine, 7 [enterokin. . . 242 4e-62 13 

gi | 57108850 | ref |XP 544824.1| PREDICTED: similar to enterope. . . 241 9e-62 

qi 1 4506151 1 ref |NP 002763.11 enterokinase precursor [Homo sa. . . 240 le-61 B 

qi 1 292449261 ref | NP 006578.2| corin [Homo sapiens] 237 le-60 B 

qi 1 48 84872 | qb | AAD31850 . 1 | corin [Homo sapiens] >gi | 13878447 .. . 237 le-60 B 

qi | 2 6332511 1 dbj | BAC29973 . 1 1 unnamed protein product [Mus mu. . . 236 2e-60 B 

qi| 7768763 | dbj | BAA95557.il enterokinase [Homo sapiens] 235 4e-60 

qi I 57095842 | ref 1XP 539291.11 PREDICTED: similar to airway t. . . 233 2e-59 

qi I 50729868 | ref |XP 425539.1| PREDICTED: similar to enterope... 233 2e-59 B 

qil 6679489|ref | NP 032967.1| protease, serine, 7 (enterokina. . . 232 4e-59 B 

qil570958501ref | XP 539295.1| PREDICTED: similar to Serine p. . . 227 le-57 

qi I 34876793 | ref |XP 223301.2| similar to hypothetical protei... 226 3e-57 B 

qi I 6648960 | qb | AAF21308 . 1 1 mosaic serine protease epithelias. . . 225 5e-57 B 

qi | 37183152 | qb | AAQ8937 6. 1 1 serine protease [Homo sapiens] >. . . 224 7e-57 B 
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qi | 6137097 | qb | AAF04328 . 1 | serine protease DESC1 [Homo sapiens] 224 7e-57 B 

qil 343282261 ref|NP 056590.21 transmembrane protease, serine... 224 le-56 El 

qi | 23512211 | qb | AAH38393 . 1 | Transmembrane protease, serine 2... 223 le-56 B 

qi | 9739019 | qbjAAF97 867 . 1 1 serine protease TMPRSS2 [Mus muse... 223 2e-5.6 B 

qil 517111541 ref|XP 487870.11 similar to airway trypsin-like. . . 222 4e-56 B 

qil 40254733|ref 1NP 569108.21 transmembrane protease, serine... 222 4e-56 El 

qil 164449621 dbj IBAB7 0683. 1| TMPRSS2 [Rattus norvegicus] 221 7e-56 B 

qil 51571899 | ref |NP 001003979.11 airway trypsin-like 3 [Ratt. . . 219 3e-55 B 

qi | 47550925 | ref |NP 999636.11 cortical granule serine protea. . . 219 3e-55 B 

qi | 556226961 ref |XP 517261.1| PREDICTED: similar to DESC1 pr. . . 218 8e-55 

qi | 296508891 qb | AAQ86772 . 1 | corin [Rattus norvegicus] >gi | 32 . . . 217 le-54 10 

qil 507299851 ref |XP 416737.11 PREDICTED: similar to transmem. . . 214 7e-54 13 

qi | 9757700 | dbj | BAB08217 . 1 | embryonic serine protease-2 [Xen. . . 214 9e-54 B 

qi | 8394057 | ref 1NP 058565.11 corin [Mus musculus] >gi | 751371 .. . 213 2e-53 B 

qi I 273703401 ref |NP 766468.11 type II transmembrane serine p... 212 3e-53 B 

qi | 55560104 | qb | AAV52922 . 1 | type II transmembrane serine pro... 212 3e-53 

qi | 47077010 | dbj IBAD18439 . 1 | unnamed protein product [Homo s... 212 3e-53 

qi| 46249844 | qb | AAH68636. 1| Unknown (protein for MGC:79882) ... 212 4e-53 

qi | 6435698 | pdb | 1EKB | B Chain B, The Serine Protease Domain O. . . 211 7e-53 El 

qi I 30089303 | dbj I BAC7588 6. 1 1 mannose-binding lectin . associat .. . 211 7e-53 

qi | 34859342 | ref | XP 215087.2| similar to hypothetical protei... 209 3e-52 El 

qi[507470791ref | XP 426340.11 PREDICTED: similar to Atrial n. . . 208 6e-52 B 

qi | 50746867 j ref | XP 426319.1| PREDICTED: similar to Serine p... 208 6e-52 B 

qi | 3008 9307 | dbj | BAC75888 . 1 | mannose-binding lectin associat... 208 6e-52 

qi | 50843972 | qb | AAT84164 . 1 | enterokinase light chain [Bos ta... 207 le-51 B 

qi | 17 981042 | qb | AAL50817 . 1 [ airway trypsin-like protease [Ra... 207 le-51 B 

qi | 56270244 | qb | AAH867 38 . 1 | Zgc: 101791 [Danio rerio] >gi|566... 206 3e-51 

qi I 34867472 | ref |XP 213668.21 similar to enteropeptidase [Ra... 205 4e-51 B 

qi I 12018322 | ref |NP 072152.11 adrenal secretory serine prote... 205 5e-51 B 

qi| 14091028 Iqb IAAK53559.il epitheliasin [Homo sapiens] 204 9e-51 B 

qil 2507 613|qb|AAC517 84. 1 1 serine protease [Homo sapiens] 204 le-50 B 

qi | 14602459 | ref |NP 005647.21 transmembrane protease, serine... 203 2e-50 B 

qi | 30353995 | qb | AAH518 39. 1 1 Transmembrane protease, serine 2... 202 3e-50 B 

qi I 51762028 | ref [XP 486010.11 RIKEN cDNA C330007D15 [Mus mus... 202 3e-50 B 

qi | 57095844 | ref [XP 539292.1| PREDICTED: similar to epiderma . . . 201 6e-50 

qi|21704106|ref |NP 663536.11 airway trypsin-like protease p... 200 2e-49 B 

qi I 33187750 | qb | AAP97729. 1 1 airway trypsin-like protease pre... 200 2e-49 B 

qi| 17 864976 |qb IAAL47139.il airway trypsin-like protease [Mu. . . 200 2e-49 B 

qi | 47211451 j emb | CAG12258 . 1 1 unnamed protein product [Tetrao. . . 199 3e-49 

qil 18141559 I ref |NP 542765.11 transmembrane protease, serine... 198 5e-49 B 

qi I 28194548 | qb 1 AA033581 . 1 1 transmembrane proteinase serine . . . 198 5e-49 

qi| 22797097 | emb | CAD22137.il type II transmembrane serine pr... 198 7e-49 B 

qi | 527 8 9219 | qb | AAH83024 . 1 | Unknown (protein for MGC:97879) ... 196 2e-48 

qi | 47225057 | emb | CAF97472 . 1 | unnamed protein product [Tetrao... 196 2e-48 

qi | 37182040 | qb 1 AAQ88823 . 1 1 ECHOS1 [Homo sapiens] >gi | 509606 .. . 194 7e-48 B 

qi| 31238311 | ref |XP 319745. 1[ ENSANGP00000006359 [Anopheles . . . 194 7e-48 B 

qi| 54639460 | qb | EAL28862.il GA18102-PA [Drosophila pseudoobs... 194 9e-48 

qi| 47585081 ref |NP 004253.11 airway trypsin-like protease [H. . . 194 le-47 B 
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qi 1 24651942 Iref |NP 610436.11 CG11824-PA [Drosophila melanog... 194 le-47 El 

qil 55622800 Iref |XP 526607.11 PREDICTED: airway trypsin-like .. . 193 2e-47 

qi 1 464094461 reflNP 997290.11 FLJ16046 protein [Homo sapiens... 193 . 2e-47 B 

qi| 546364571 qb | EAL25860.il GA11219-PA [Drosophila pseudoobs... 193 2e-47 

qi | 552353091 gb | EAL39005. 1 | ENSANGP00000028900 [Anopheles ga . . . 191 6e-47 

qi 1 13507652 1 reflNP 109634.11 transmembrane protease, serine... 191 6e-47 El 

qi|13878822|sp|Q9ER04|TMS5 MOUSE Transmembrane protease, se... 191 6e-47 19 

qi 1 12248777 | dbj IBAB2027 6. 1 | type 2 spinesin [Mus musculus] 191 6e-47 B 

qi 1 12248781 | dbj IBAB2027 8. 1 | type 1 spinesin [Mus musculus] 191 6e-47 19 

qi|13959319|sp|Q61129|CFAI MOUSE Complement factor I precur... 191 6e-47 19 

qi | 57088547 | ref|XP 547177.11 PREDICTED: similar to marapsin... 191 8e-47 

qi|51591911|ref |NP 001004020.11 airway trypsin-like 5 [Ratt... 191 le-46 19 

qi | 34419641 | ref | NP 899234.1| ovochymase 1 [Homo sapiens] >g. . . 191 le-46 B 

qi | 26325646 | dbj | BAC26577 . 1 | unnamed protein product [Mus mu... 191 le-46 19 

qi 1 13173471 1 ref |NP 076927.1| transmembrane protease, serine... 190 2e-46 El 

qi 1 1079140 |pir | IA47547 serine proteinease stubble-stubbloid. . . 190 2e-46 

qi I 17136448 | ref |NP 476709.11 CG4316-PA [Drosophila melanoga .. . 190 2e-46.0 

qi I 55667225 | ref | XP 520817.11 PREDICTED: similar to ovochyma... 190 2e-46 

qi |49481858| qb | AAT66641 . 1 | transmembrane protease serine 3 ... 190 2e-46 

qi| 23463311 | ref | NP 695223.11 adrenal mitochondrial protease... 190 2e-46 19 

qi | 22770864 | qb [ AAN06758 . 1 | adrenal mitochondrial protease s... 190 2e-46 El 

qi| 2142 91061 qb | AAM50272.il LD44584p [Drosophila melanogaster] 190 2e-46 

qi | 13162353 | ref |NP 077071.11 complement factor I [Rattus no... 189 2e-46 19 

qi | 2166967 9 1 dbj | BAC018 64. 1 1 complement factor I [Triakis sc.. 189 3e-46 

qi | 334167 84 | qb | AAH56000 . 1 | MGC68910 protein [Xenopus laevis] 189 4e-46 El 

qi | 49522964 | qb | AAH752 93. 1 | Transmembrane serine protease 9 ... 189 4e-46 19 

qi | 20072342 | qb | AAH2 6555 . 1 | Kallikrein B, plasma 1 [Mus muse... 189 4e-46 19 

qi I 32189423 | ref | NP 848845.11 RIKEN cDNA 4732406D01 gene [Mu. . . 188 5e-46 B 

qi| 135405351 ref 1 NP 110397.1| transmembrane protease, serine... 188 5e-46 B 

qi| 481304531 ref [XP 393317.11 similar to ENSANGP00000006359 . . . • 188 5e-46 19 

qi | 2 6331844 | dbj | BAC29652. 1 | unnamed protein product [Mus mu. . . 188 7e-46 19 

gi|57095848|ref | XP 539294.11 PREDICTED: similar to RIKEN cD. . . 188 7e-46 

qi| 6680267[ref | NP 032307.1| hepsin [Mus musculus] >gi|26060... 188 7e-46 19 

gi|28892897|ref | NP 795998.11 RIKEN cDNA 9930019B18 gene [Mu. . . 188 7e-46 19 

qi|30179800|splO35453|HEPS MOUSE Serine protease hepsin 188 7e-46 19 

qi [27695202 | qb IAAH41753.il MGC53615 protein [Xenopus laevis] 187 9e-46 19 

qi | 47228191 | emb | CAG07586. 1 | unnamed protein product [Tetrao. . . 187 9e-46 

qi [ 6680584 | ref | NP 032481.1| kallikrein B, plasma 1 [Mus mus... 187 le-45 19 

qi | 55728266 | emb | CAH90878 . 1 | hypothetical protein [Pongo pyg. . . 187 le-45 

qi|21312500|ref | NP 082342. 1[ coagulation factor XI [Mus mus... 187 2e-45 19 

qi| 50746923 | ref |XP 420678.11 PREDICTED: similar to Coagulat... 187 2e-45 19 

qi| 6009515 [ dbj | BAA84941.il epidermis specific serine protea. . . 187 2e-45 19 

qi [ 45708911 | qb | AAH67937.il Hypothetical protein MGC69547 [X... 186 2e-45 19 

qi | 50603930 | qb | AAH77417 . 1 | Klkbl-prov protein [Xenopus laevis] 186 3e-45 E3 

qi| 14042984|ref | NP 114435.1| mosaic serine protease [Homo s... 186 3e-45 19 

qi| 13429970 | dbj | BAB397 41.il membrane-type mosaic serine pro... 186 3e-45 19 

qi I 55391473 | qb| AAH85323.il Unknown (protein f or MGC : 103261 ).. . 186 3e-45 

qi I 38089812 [ ref |XP 150167.31 similar to BC010843 protein [M. . . 186 3e-45 19 



http://www.ncbi.nlm.nih.gov/BLAST/Blast.cgi 



1/13/05 





27503083 | qb | AAH42878 . 1| 


BC010843 protein [Mus musculus] 


186 


3e- 


45 


El 


qi 


34877566|ref |XP 343616.11 


similar to polyserase 1A prote... 


185 


4e- 


45 


m 




28893147 | reflNP 796136.11 


RIKEN cDNA 9930032022 gene [Mu. . . 


185 


4e- 


45 


B 


qi 


55732638 | emb | CAH93018 . 1 | 


hypothetical protein [Pongo pyg. . . 


185 


4e- 


45 




qi 


13811665 |qb IAAK40233.1I 


coagulation factor XI [Mus muscu. . . 


185 


4e- 


45 


BE] 


qi 


26331892 (dbi IBAC2967 6. 1| 


unnamed protein product [Mus mu. . . 


185 


4e- 


45 


IE! 


q 1 


33695155 | reflNP 892028.11 


hepsin (transmembrane protease... 


185 


6e- 


45 




qi 


34864591|ref |XP 236201.21 
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prostasin preproprotein [Homo s... 


183 


2e- 


44 


m 


qi 


31206367 [ reflXP 312135.1| 


ENSANGP00000022018 [Anopheles . . . 
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qi'l 472256861 emb I CAG08029 . 1 1 unnamed protein product [Tetrao. . . 180 2e-43 

qi I 47522962 I ref I NP 999239.11 kallikrein [Sus scrofa] >gi|41... 179 2e-43 B 

qi I 2599504 | qb 1 AAB96905 . 1 1 serine .protease [Xenopus laevis] 179 2e-43 B 

qj|4504877|ref |NP 000883.11 plasma kallikrein Bl precursor . . . 179 3e-43 10 

qi|31222400|ref |XP 317173.1| ENSANGP00000018384 [Anopheles . . . 179 3e-43 B 

qi | 55237402 | qb | EAA12264 . 2 | ENSANGP00000018384 [Anopheles ga... 179 3e-43 

qi I 20142083 | dbj IBAB88921. 1 1 complement control protein fact... 17_9 4e-43 

qi I 42543248 | pdbl 1P57 | B Chain B, Extracellular Domain Of Hum. . . 179 4e-43 § 

qi I 116133 | sp| P05156I CFAI HUMAN Complement factor I precurso. . . 178 5e-43 19 

qi | 55623630 | ref | XP 526757.1] PREDICTED: plasma kallikrein B. . . 178 5e-43 

qi| 1335054 | emb | CAA68416.il unnamed protein product [Homo sa . . . 178 5e-43 El 

qi| 5572 9915 | emb | CAH91684. 1 | hypothetical protein [Pongo pyg. . . 178 5e-43 

qi | 41032 6 | emb | CAA80515 . 1 | trypsin [Anopheles gambiae] >gi|6... 178 5e-43 

qi | 50540258 1 ref |NP 001002596.11 zgc: 92313 [Danio rerio] >gi . . . 178 7e-43 B 

qi|32698841jref |NP 872308.1| hypothetical protein DKFZp686L. . . 177 9e-43 B 

qi 1 23312 17 | qb | AAB66878.il trypsin [Anopheles stephensi] 177 9e-43 

qi I 28202035 | ref |NP 780649.11 pancreasin [Mus musculus] >gi | . . . 177 le-42 B 

qi | 47507258 I qb I AAH70996 . 1| Unknown {protein for MGC:7992 8) ... 177 2e-42 

qi | 410330 | emb | CAA80512 . 1 | trypsin [Anopheles gambiae] >gi | 4 . . . 176 2e-42 

qi I 57097947 | ref |XP 532838.11 PREDICTED: similar to Plasma k. . . 176 2e-42 

qi| 413931051 ref |NP 958880.11 plasminogen [Danio rerio] >gi | . . . 176 2e-42 El 

qi | 410327 | emb | CAA80516. 1 | Trypsinogen precursor of ANTRYP7 . . . 176 2e-42 

qi | 25005112 1 qb | AAN71006 . 1 | plasminogen precursor [Danio rerio] 176 2e-42 B 

qi | 34980909 | qb | AAH57186 . 1 | Pig protein [Mus musculus] >gi | 1 . . . 176 3e-42 B 

qi I 56199540 | qb | AAV84259 . 1 1 serine protease [Culicoides sono. . . 176 3e-42 

qi | 38051823 | qb | AAH60513 . 1| Plasminogen [Homo sapiens] 176 3e-42 B 

qi | 55732729 | emb | CAH93062 . 1 | hypothetical protein [Pongo pyg... 176 3e-42 

qi | 55730120 | emb | CAH91784 . 1 | hypothetical protein [Pongo pyg... 176 3e-42 

qi | 557273361 emb | CAH90424. 1 | hypothetical protein [Pongo pyg... 176 3e-42 

qi| 99613531 ref I NP 062505.11 platelet coagulation factor XI ... 176 3e-42 El 

qi I 6671744 | ref |NP 031712.11 complement component factor i [... 176 3e-42 B 

qi I 4503627 | ref |NP 000119.11 platelet coagulation factor XI . . . 176 3e-42 El 

qi | 180352 | qb | AAA51985 . 1 | coagulation factor XI 176 3e-42 B 

qi I 24657332 | ref |NP 652645.1| CG18735-PA [Drosophila melanog. . . 176 3e-42 B 

qi | 311983 | emb | CAA79327 . 1 | trypsin [Anopheles gambiae] >gi | 4 . . . 175 5e-42 

qi|2144495|pir| | PLMS plasmin (EC 3.4.21.7) precursor - mous . . . 175 5e-42 B 

qi| 312223931 ref | XP 317172.1| ENSANGP00000018316 [Anopheles . . . 175 5e-42 B 

qi | 55237401 | qb | EAA12262 . 3 | ENSANGP00000018316 [Anopheles ga . . . 175 5e-42 

qi| 19032293 | dbj | BAB85634.il trypsinogen [Anguilla japonica] 175 6e-42 

qi I 2815618 | qb | AAB97887 . 1 1 plasminogen [Papio hamadryas] 175 6e-42 

qi | 190026 | qb | AAA36451 . 1 | plasminogen 175 6e-42 B 

qi I 37362124 | qb| AAQ91197.il prostasin [Mus musculus] >gi|191... 174 8e-42 B 

qi| 387 031|qb|AAA60124. 1| plasminogen 174 8e-42 B 

qi| 625234 | pir | | PLHU plasmin (EC 3.4.21.7) precursor [valida... 174 8e-42 B 

qi| 45058811 ref |NP 000292.11 plasminogen [Homo sapiens] >gi | . . . 174 8e-42 B 

qi I 31209815 | ref I XP 313874.11 ENSANGP00000011601 [Anopheles . . . 174 8e-42 B 

qi | 6137624 | pdb | 1BUI | B Chain B, Structure Of The Ternary Mic... 174 8e-42 111 

qi I 27806815 | ref I NP 77 6376. 1 1 plasminogen [Bos taurus] >gi 1 1 . . . 174 le-41 B 

qi I 34878557 | ref |XP 224872.21 similar to coagulation factor . . . 174 le-41 B 

gi | 54636853 | gb | EAL26256 . 1 | GA15058-PA [Drosophila pseudoobs... 174 le-41 
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174 le-41 
2e-41 
173 2e-41 



qi I 41055921 1 ref INP 956439.11 similar to adrenal secretory s... 174 le-41 

gi | 47227752 lembl CAG08915.il unnamed protein product [Tetrao... 174 le-41 

qil6435717|pdb|lQRZjD Chain D, Catalytic Domain Of Plasmino... 174 

qil57088539lref IXP 547173.11 PREDICTED: similar to EOS prot... 173 

ail 55732953 lembl CAH93164.il hypothetical protein [Pongo pyg... ill 

gj|27573762|pdb|lL4Z|A Chain A, X-Ray Crystal Structure Of . . . 173 2e-41 g 

qil27573760|pdb|lL4D|A Chain A, Crystal Structure Of Microp... 173 2e-41 H 

ail 7245727 ipdbllDDJID Chain D, Crystal Structure Of Human P. . . 173 2e-41 &t 

ail 5821851 Ipdbl 1BML | B Chain B, Complex Of The Catalytic Dom. . . 173 2e-41 H 

gill30314lsp|P80009IPLMN CAN FA Plasminogen 111 2 *~\\ 

ail 57031802 I ref IXP 533468.11 PREDICTED: hypothetical protei . . . 173 2e-41 

gi I 31242535 1 ref 1 XP 321698.11 ENSANGP00000015747 [Anopheles ... 173 2e-4lH 

gi I 47523468 I ref I NP 999356.11 tryptase [Sus scrofa] >gi|1820... 173 2e-41 B 

gj|56967288lpdb|lXX9|B Chain B, Crystal Structure Of The Fx... 173 2e-41 11 

gi 1 10441463 I qb | AAG17054.il channel activating protease 1 [M. . . 173 2e-41 El 

gi 1 1698668 I gb I AAB37261 . 1 1 late trypsin precursor [Culex pip... 173 ^e-41 

gi|136332031sp|Q9ESDHPSS8 MOUSE Prostasin precursor (Chann. . . 173 2e^41 

gill30315lsp|P80010IPLMN HORSE Plasminogen ill 3e-41 

gi I 52219018 I ref I NP 001004582.11 zgc: 92292 [Danio rerio] >gi . . . 172 3e-41 IH 

ail 138988411 gb I AAK48894.il CUB-serine protease [Panulirus a... 172 3e _^ 

ail 5564 lembl CAA45715.il put. trypsin [Aedes aegypti] >gi I 67 . . . 172 3e 41 

gi 1 3006086 lembl CAA75311.il trypsin [Litopenaeus vannamei] 172 3e 41 

gj|2118100lpir| 1A61545 plasmin (EC 3.4.21.7) precursor - ho... 172 3e 41 

gi I 15042732 1 gb IAAK82432.il factor XI [Oryctolagus cuniculus] 172 



3e-41 



172 4e-41 



gi I 5295890 lembl CAB46014. 1 1 plasminogen protein [Rattus norv. 

qi I 57 086269 I ref | XP 546520.11 PREDICTED: similar to transmem. . . 172 4e^41 

gil 32402373 I qb | AAP81159. 1 1 trypsinogen [Pangasius hypophtha... 172 4e-41 

qil 55240943|qb| EAA09162 . 2 | ENSANGP00000011601 [Anopheles ga. . . 172 4e-41 

gil 480988221 refl XP 394832.11 similar to ENSANGP00000012201 ... 172 4e-41 

qil 47222714 lembl CAG00148. 1 | unnamed protein product [Tetrao... 172 4e 41 

qi I 2815616 1 qb IAAB97886.il apolipoprotein a [Papio hamadryas] 172 4e-41 

qi 156967297 I pdb 1 1XXF |B Chain B, Crystal Structure Of The Fx... 172 5e-41 

qil 199227141 ref |NP 611611.11 CG4386-PA [Drosophila melanoga. . . 171 7e-41 
qil 521390881 qb | AAH82670. 1| Xesp-1 protein [Xenopus laevis] 
qil 1272431 Iqb | A7UV97479.il Astrypl 

qi | 39654880 I pdb 1 1RJX | B Chain B, Human Plasminogen Catalytic. 
qil 67597 | pir | I KFBO coagulation factor IXa (EC 3.4.21.22) pr. 

qil 383721731 sp I Q95ND71 FA9 PANTR Coagulation factor IX precu... 171 9e 41 

qil2134456lpirl 1146260 plasmin (EC 3.4.21.7) precursor - we... 171 9e-41 

qi|86639|pirl IB30848 plasmin (EC 3.4.21.7) precursor - rhes... 171 le-40 

qi I 57097943 I ref I XP 532836.11 PREDICTED: similar to Coagulat... 171 le 40 

qil 56270387 I qb | AAH87611.il Unknown (protein for MGC: 97869) ... 171 le-40 

gil 67596 I pir I I KFHU coagulation factor IXa (EC 3 . 4 . 21 . 22 ) pr . . . 170 le-40 B 

qil 47216479 lembl CAG02130.il unnamed protein product [Tetrao... 170 le 40 

qi I 6650672 1 gb I AAF21966 . 1 1 heart specific serine proteinase . . . 170 le-40 g 

qi I 4503649 1 ref I NP 000124.11 coagulation factor IX [Homo sap... 169 2e-40 j£j 

qi l 9757698-1 dbj 1 BAB08216. 1 1 embryonic serine protease-1 [Xen. . . 169 2e-40 El 

qi 1 57 087141 1 ref 1 XP 536782.11 PREDICTED: similar to chymotry, . . 169 3e-40 

qil 38512040 I qb I AAH61083 . 1| Chymotrypsinogen Bl [Mus musculu. . . 169 3e-40 El 

qi 1 50806652 I ref 1 XP 424472.11 PREDICTED: similar to proacros... 169 3e-40 El 

qi I 4210804 1 emb I CAA76929 . 1 1 trypsin [Phaedon cochleariae] 169 3e 40 

qi I 21703806 | ref | NP 663378.11 transmembrane protease, serine... 169 4e-40 IH 
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171 7e-41 

171 7e-41 

171 9e-41 
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45758733|gb|AAS76646.1| blood coagulation factor IX [Bos... 


169 


4e- 


■40 IE] 


qi| 


56710319 | reflNP 001008665.11 blood coaqulation factor IX... 


169 


4e- 


•40 




50746887 | ref|XP 426329.11 PREDICTED: similar to Compleme . . . 


169 


4e- 


-40 l£] 


qil 


481401731 ref|XP 397087.1! similar to Astrypl [Apis melli... 


169 


4e- 


■40 IE] 


qi i 


5441859 |dbj | BAA82365. 1 | chymotrypsinogen 1 [ Paralichthys . . . 


169 


4e- 


■40 


qi 


34870498 | ref|XP 220222.2) similar to serine protease [Ra... 


168 


6e- 


-40 Ifil 


qi 


54636854 |gb| E7VL26257.il GA18150-PA [Drosophila pseudoobs... 


168 


6e- 


-40 


qi 


13385032 | ref|NP 079859.11 chymotrypsinogen Bl [Mus muscu. . . 


168 


6e- 


-40 la 


qi 


3559980 | emb | CAA06644. 1 | serine protease [Rattus rattus] 


168 


6e- 


-40 


qi 


6572446 | emb | CAB63112 . 1 | serine protease [ Pacif astacus le... 


168 


6e- 


-40 


qi 


182609|gb|AAA98726. 1| factor IX 


168 


6e- 


-40 la 


qi 


57088545 | ref|XP 547176.11 PREDICTED: similar to protease... 


168 


7e- 


-40 


qi 


5525101 |dbi | BAA82522 . 1 | tunicate retinoic acid-inducible . . . 


168 


7e- 


■40 


qi 


56611133|qb|AAH87787.1| Unknown (protein f or MGC : 108057 ).. . 


167 


9e- 


-40 


qi 


55637037 | ref|XP 508786.11 PREDICTED: similar to Transmem. . . 


167 


9e- 


•40 


qi 


19698564 | qb | AAL93209 . 1 | early trypsin-like protein precu. . . 


167 


9e- 


•40 


qi 


6911219 |qb IAAF31436.il type II membrane serine protease ... 


167 


le- 


■39 Q 


qi 


37182183 |qb IAAQ88894.il TMPRSS3 [Homo sapiens] 


167 


le- 


-39 B 


qi 


15451940 | ref|NP 063947.11 transmembrane protease, serine... 


167 


le- 


-39 la 


qi 


5562 | emb | CAA45714 . 1 | put. trypsin [Aedes aegypti] >gi | 67 . . . 


167 


le- 


•39 


qi 


26347427 | dbi | BAC37362 . 1 | unnamed protein product [Mus mu. . . 


167 


le- 


-39 E3 


qi 


108088 |pir| IA21195 chymotrypsin (EC 3.4.21.1) 2 precurso. . . 


167 


2e- 


•39 


qi 


105012 |pir| IA32869 apolipoprotein (a) (EC 3.4.21.-) - rhe . . . 


167 


2e- 


-39 


qi 


12841192 | dbi (BAB25112. 1 | unnamed protein product [Mus mu. . . 


167 


2e- 


-39 la 


qi 


11545839 | ref |NP 071402.11 protease, serine, 22 [Homo sap... 


166 


2e- 


■39 B 


qi 


37181917 |qb IAAQ88762.il PRSS22 [Homo sapiens] 


166 


2e- 


-39 la 


qi 


21070351 |qb IAAM34268.il early trypsin [Aedes aegypti] 


166 


2e- 


-39 


qi 


47212882 | emb | CAF90331 . 1 | unnamed protein product [Tetrao. . . 


166 


2e- 


-39 


qi 


2734092 | qb|AAB93671 . 1 | SP001LA [Homo sapiens] 


166 


2e- 


-39 El 


qi 


30230633 |qb IAAP20885.il tryptase-6 [Mus musculus] 


166 


3e- 


Bl 

-39 la 


qi 


57164273|ref |NP 001009412.11 tryptase [Ovis aries] >qi | 4 . . . 


166 


3e- 


-39 la 


qi 


31324554 | ref |NP 852142.11 protease, serine, 21 [Rattus n. . . 


166 


3e- 


. . Bl 
-39 la 


qi 


55240925 | qb | EAA09284 . 2 | ENSANGP00000012216 [Anopheles ga... 


166 


3e- 


-39 


qi 


50979168 | ref |NP 001003323.11 coaqulation factor IX [Cani... 


166 


4e- 


-39 la 


qi 


57105368 | ref |XP 543288.11 PREDICTED: similar to nicotini... 


166 


4e- 


-39 


qi 


24581459 | ref |NP 722915.11 CG31954-PA [Drosophila melanoq. . . 


166 


4e- 


-39 B 


qi 


41054047 | ref | NP 956180.11 chymotrypsin C (caldecrin) [Da... 


166 


4e- 


Bl 

-39 la 


qi 


326989401 ref |NP 872365.1| hypothetical protein MGC57341 ... 


166 


4e- 


-39 la 


qi 


49119431 |qb IAAH73506.il MGC82715 protein [Xenopus laevis] 


166 


4e- 


^ a Bl 

-39 la 


qi 


56208061|emb|CAI21060.1| novel elastase protein (zgc:637... 


166 


4e- 


-39 


qi 


14009271|qb|AAK50138.1| early trypsin [Culex pipiens qui... 


166 


4e- 


-39 


qi 


67552 | pir | | TRDG trypsin (EC 3.4.21.4) precursor, anionic... 


165 


5e- 


-39 


qi 


50318851 ref |NP 005568.11 lipoprotein, Lp(a) [Homo sapien. . . 


165 


5e- 


~. /-x Bl 
-39 la 


qi 


246482451 ref |NP 650825.11 CG7432-PA [Drosophila melanoqa. . . 


165 


5e- 


Bl 

-39 la 


qi 


55237399 | qb| EAL39600. 1 | ENSANGP00000026990 [Anopheles qa... 


165 


5e- 


-39 


qi 


56203915 |emb| CAI22905.il LPA [Homo sapiens] >qi | 55665907 .. . 


165 


5e- 


-39 


qi 


559508 1 emb |CAA57701.1| trypsin [ Paranotothenia magellani... 


165 


5e- 


-39 


qi 


480984091 ref |XP 394066.11 similar to CG9372-PA [Apis mel . . . 


165 


5e- 


R9 

-39 la 


qi 


225794 |prf| | 1313352A apolipoprotein a 


165 


5e- 


-39 


qi 


6978717 | ref |NP 036668.1| Chymotrypsinoqen B [Rattus norv. . . 


165 


6e- 


Bl 

-39 la 


qi 


410329 | emb | CAA80518 . 1 | trypsin [Anopheles gambiae] >gi | 3 . . . 
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Si 
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Si 
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qi 
qi 



qi 
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Si 
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11527822 |qb|AAG37012. 1 | serine protease TADG12 [Homo sap. 
20142081 [ dbj I BAB88920. 1 1 complement control protein fact. 
56681219 |qb| AAW21246. 1 | digestive serine protease II [Ma. 
4098568 I qb | AAD00320 . 1 1 plasminogen activator sPA [Scolop. 
5441853 |dbj |BAA82362.1| trypsinogen 1 [Paralichthys oliv. 

216145311 ref|NP 659205.1| testisin isoform 2 [Homo sapie. 
54639795 Iqb I EAL29197 . 1 1 GA20347-PA [Drosophila pseudoobs. 

481304501 refjXP 393316.11 similar to CG8213-PA [Apis mel . 



47220856|emb|CAG00063. 1 | unnamed protein product [Tetrao. 
47214192 1 emb | CAG00820 . 1 | unnamed protein product [Tetrao. 
4753837 | emb | CAB41989 . 1 | tryptase [Ovis aries] >gi 1182036. 
67550 1 pir I | TRPGTR trypsin (EC 3.4.21.4) precursor - pig . 
13516893 I dbj 1 BAB40330 . 1 1 trypsinogen II [Engraulis japon. 



5822391 1 pdbl 1RFN | A Chain A, Human Coagulation Factor Ixa. 
11055972 I ref|NP 065233.21 tryptase 4 [Mus musculus] >gi I . 
17538534|ref |NP 501379.11 serine protease 22D (41977) [C. 



29612490 I qb| AAH49588 . 1 1 Prss21 protein [Mus musculus] 
54418551 dbj I BAA82363.il trypsinogen 2 [Paralichthys oliv. 
1363091|pir| [S56160 mast cell tryptase precursor - Mongo . 
54645028 | qb | EAL337 68 . 1 | GA16585-PA [Drosophila pseudoobs. 



51247 093 I pdb 1 1H9I | E Chain E, Complex Of Eeti-Ii Mutant W. 
47214193 I emb | CAG00821 . 1 1 unnamed protein product [Tetrao. 



10835849|pdb| 1FNI |A Chain A, Crystal Structure Of Porcin. 

4506147 I ref [NP 002761.11 protease, serine, 2 preproprote. 

33126583 | qb [ AAL14244 . 1 | protease serine 2 isoform B [Horn. 

16758930|ref [NP 446461. H chymotrypsin-like [Rattus norv. 

47086795|ref |NP 997783.1| chymotrypsinogen Bl [Danio rer. 
50604052 |qb|AAH78367. 1| Unknown (protein for IMAGE: 70385. 



3318722 I pdb 1 1AN1 1 E Chain E, Leech-Derived Tryptase Inhib.. 
64386 I emb I CAA4967 8.il trypsin II [Salmo salar] >gi 142258.. 

27465583 I ref |NP 775150.11 cationic trypsinogen [Rattus n. . 

31238307 [ref 1XP 319744.11 ENSANGP00000017160 [Anopheles .. 

45550384 I ref |NP 610435.31 CG8213-PA [Drosophila melanoga . . 
55235308 | qb | EAA14903 . 2 | ENSANGP00000017160 [Anopheles ga . . 
18447345 | qb | AAL68238 . 1 | LD43328p [Drosophila melanogaster] 

13094662 lqb|AAK11956. 1 | neonatal thrombolytic agent alph. . 

19354409 |qb| AAH24374. 1 | Mcpt6 protein [Mus musculus] >gi . . 

3718318 6 |qb I AAQ89393. 1| PRSS21 [Homo sapiens] >gi|580319.. 

30583551 1 qb IAAP36020. 1 1 chymotrypsinogen Bl [Homo sapien. . 

56972040 | qb j AAH87 918 . 1 1 Chymotrypsin-like [Mus musculus] . . 
30584037 | qb | AAP36267 . 1 | Homo sapiens chymotrypsinogen Bl . . 
54641961 |qb I EAL307 10. 1| GA18522-PA [Drosophila pseudoobs.. 

23593662 I ref |XP 128541.21 RIKEN cDNA 4931440B09 [Mus mus.. 
47220857 | emb | CAG00064 . 1 | unnamed protein product [Tetrao.. 
2443296 I dbj | BAA22400. 1 1 26kDa protease [Sarcophaga pereg.. 

12843559 | dbj | BAB26029 . 1 | unnamed protein product [Mus mu. . 
32394742 | qb | AAM96942 . 1 | trypsin 3 [Phlebotomus papatasi] 

28274638 I qb | AAO34406 . 1 1 tissue plasminogen activator; TP. . 



qi 



42543835 |pdb| 1UTM|A Chain A, Trypsin Specificity As Eluc. . 
13183620 I qb I AAK15274 . 1 1 sea star regeneration-associated.. 

I 971196 I qb I AAA75001.1j trypsinogen 
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34784896 |qb IAAH56849.1I MGC64417 protein [Xenopus laevis] 
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9e- 


38 


Ed 


qi 


34866384 | ref|XP 217270. 2| similar to testis serine prote . . . 


161 


9e- 


38 


l£] 


qi 


57097397 | ref | XP 532744.11 PREDICTED: similar to trypsin ... 


161 


. 9e- 


38 




qi 


56681217 |qb IAAW21245.il diqestive serine protease I [May... 
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le- 


37 




qi 


14702169) ref |NP 127509.11 plasminogen activator, tissue ... 


160 


le- 


37 


IE! 


qi 


423498 |pir| IA47246 tryptase (EC 3.4.21.59) 2 - mouse >gi . . . 


160 


le- 


37 


■a 

lEI 


qi 


441174 | dbj | BAA00881 . 1 | tissue plasminogen activator [Horn... 


160 


le- 


37 


ta 


qi 


4505861 | ref |NP 000921.11 plasminogen activator, tissue t... 


160 


le- 


37 


IE] 


qi 


33877196|gb| AAH02795.2 | PLAT protein [Homo sapiens] 


160 


le- 


37 


l£] 


qi 


33872180|gb|AAH18636.2| PLAT protein [Homo sapiens] >gi | . . . 


160 


le- 


37 


El 


qi 


21357887 | ref |NP 648711.11 CG4914-PA [Drosophila melanoga... 


160 


le~ 


37 


19 
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qi 


30583865 | gb | AAP36181 . 1 | Homo sapiens plasminogen activat... 
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le- 


37 




qi 


55630632 | ref |XP 528120.11 PREDICTED: similar to PLAT pro... 
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le- 


37 




qi 


31981624 | ref |NP 112464.21 mast cell protease 7 [Mus muse... 
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le- 


37 


19 
l£] 


qi 


52551331|gb|AAU84664.1| trypsin [Oreochromis niloticus] 
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le- 


37 




qi 


339834 | gb IAAB59510 . 1 | plasminogen activator [Homo sapiens] 
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le- 


37 


19 


qi 


67572 |pir M KYBOB chymotrypsin (EC 3.4.21.1) B precursor ... 
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le- 


37 
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El 


qi 


785035 | emb | CAA60129. 1 | trypsin [Litopenaeus vannamei] >g... 
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le- 


37 




qi 


3006082 | embj CAA75309. 1 | trypsin [Litopenaeus vannamei] 
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le- 


37 




qi 


574151 emb ICAA41752.il trypsin V b-form [Rattus rattus] >... 
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37 




qi 


556315901 ref |XP 528278.11 PREDICTED: similar to tryptoph. . . 
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37 




qi 


54641711 | gb| EAL30461. 1 | GA21737-PA [Drosophila pseudoobs... 
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37 




qi 


49115873 | gb|AAH73613. 1 | MGC82927 protein [Xenopus laevis] 
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2e- 


37 
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El 


qi 


50753637 | ref |XP 425105.1| PREDICTED: similar to chymotry. . . 
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2e- 


37 


19 
IE] 


qi 


5921501 | emb | CAB56465 . 1 | distal intestinal serine proteas... 
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37 
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El 


qi 


38383098|gb|AAH62334.1| Unknown (protein for MGC:71136) ... 
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2e- 


37 
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qi 


34868742 | ref |XP 220209.2 1 similar to tryptase 5 [Rattus ... 
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37 
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El 


qi 


56971185|gb|AAH88769. 1| Unknown (protein f or MGC : 108128 ).. . 
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qi 


20988034 |gb IAAH30260.1I PRSS2 protein [Homo sapiens] 
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qi 


54645404 |gb IEAL34144.il GA17401-PA [Drosophila pseudoobs... 
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37 




qi 


47132444 | gb|AAT11803. 1 | pancreatic trypsinogen [Struthio... 
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37 




qi 


26006097 | dbi | BAC41392 . 1 | anionic trypsin [Oncorhynchus k. . . 
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37 




qi 


39587498 1 emb ICAE58436.il Hypothetical protein CBG01571 [... 
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2e- 


37 




qi 


27710074 | ref | XP 231718.11 similar to Trypsin V-A precurs . . . 
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El 


qi 


31207113|ref|XP 312523.11 ENSANGP00000014938 [Anopheles ... 
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37 


19 


qi 


23097244 | ref |NP 690851.11 protease, serine, 33 [Homo sap... 
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37 
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El 


qi 


24581698 | ref |NP 608848.1| CG3355-PA [Drosophila melanoga... 
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37 


19 

El 


qi 


33126535 | gb|AAL14243. 1 | protease serine 4 isoform B [Horn... 
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37 




qi 


21536452 | ref | NP 002762.2| mesotrypsin preproprotein [Horn... 
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3e- 


37 


19 
El 


qi 


55242354 | gb | EAA07518 . 2 | ENSANGP00000014938 [Anopheles ga... 
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3e- 
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qi 


52551333 |gb |AAU84665.1| trypsin [Oreochromis aureus] 
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3e- 


37 




qi 


20988417|gb|AAH30238.1| Unknown (protein for IMAGE : 45379 .. . 
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3e- 


37 


19 
El 


qi 


33416522 | gb IAAH55854 . 1 | Protease, serine, 22 [Mus musculus] 


159 


3e- 


37 


19 
El 


qi 


135168911 dbi IBAB40329.il trypsinogen [Engraulis japonicus] 
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3e- 


37 




qi 


4868339 | gb IAAD31269 . 1 | trypsinogen RdoT3 precursor [Rhyz... 


159 


3e- 


37 




qi 


60663781 emb ICAB58178.il trypsinogen IV a-form [Homo sapi . . . 
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3e- 


37 


19 

El 


qi 


3452120 | gb IAAC32752 . 1 | trypsinogen 2 precursor [Pseudopl... 
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3e- 


37 




qi 


7960528) emb |CAB91840.1| serine protease [Anopheles gambiae] 
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3e- 


37 




qi 


423134|pir| IS33496 trypsin (EC 3.4.21.4) IV form a - hum... 
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3e- 


37 


19 

El 


qi 


6980535|pdb|2STB|E Chain E, Anionic Salmon Trypsin In Co... 
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3e- 


37 


11 


qi 


57086241 1 ref | XP 546505.1| PREDICTED: similar to Transmem. . . 
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37 
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47078378 I qb I AAT09850. 1| 



2305256|qb|AAB65760. lj 



trypsin-like serine proteinase [. 
plasminogen [Macropus eugenii] >g. 

1827742 |pdb| 1PFX|C Chain C, Porcine Factor Ixa 
556445251 ref|XP 523475.11 "PREDICTED: similar to TPS1 pro. 
3006084 | emb | CAA75310 . 1 1 trypsin [Litopenaeus vannamei] 
41350551 | qb| AAS 00516. 1 1 trypsin [Oreochromis aureus] 

514730391 ref|XP 496169.11 PREDICTED: similar to Chymotry. 



1552516|qb|AAC80208. 1[ trypsinogen C [Homo sapiens] 
643821 emb ICAA49676. 11 trypsin IA [Salmo salar] >gi| 42258. 
643801 emb|CAA49680. 1] trypsin I [Salmo salar] >gi 1422581. 

6981420 1 ref | NP 036767.11 pancreatic trypsin 1 [Rattus no. 

19353198 | qb| AAH24903. 1 | Prss32 protein [Mus musculus] 
49256410 | qb | AAH73145. 11 CTRB1 protein [Homo sapiens] 

24496768 | ref |NP 038949.21 transmembrane protease, serine. 

30704873 | qb| AAH51852 . 1 | TPSAB1 protein [Homo sapiens] 

20380845 | qb | AAH28059 . 1 | TPSAB1 protein [Homo sapiens] >g. 

208990301 ref |XP 128466.11 tryptase 5 [Mus musculus] 

51711962 | ref |XP 487918.11 PREDICTED: similar to trypsino. 

51010909 | ref |NP 001003405.11 RIKEN cDNA 1810049H19 gene . 



468751 | emb | CAA55242 . 1 | chymotrypsin; prechymetrypsinogen. 



809221 | pdb | 1BIT | The Crystal Structure Of Anionic Salmo. 
47225581 1 emb | CAG12064 . 1 | unnamed protein product [Tetrao. 

30421098 | qb| AAP23216. 1 | tryptase 5 [Mus musculus] 



18158713|pdb|lHJ8|A Chain A, 1.00 Aa Trypsin From Atlant. 

24582982 | ref |NP 523518.21 CG9564-PA [Drosophila melanoga. 
57162224 | emb | CAI39655 . 1 | OTTHUMP00000045395 [Homo sapien. 
38 678694 Iqb | AAR26346. 1 | factor IX [Felis catus] >gi|5716. 

37182852 | qb | AAQ8 922 6. 1 | tryptophan/serine protease [Homo. 

47086363[ref | NP 998000.11 elastase 2 like [Danio rerio] . 

45383277 | ref |NP 989773.11 coagulation factor VII [Gallus. 

48098818 | ref 1XP 394830.1| similar to CG4914-PA [Apis mel . 



14270123 I dbj |BAB58886.1| coagulation factor XI [Pan trog. 



20149993 | pdb | 1H4W | A Chain A, Structure Of Human Trypsin . 
12840814 | dbj | BAB24967 . 1 | unnamed protein product [Mus mu. 
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Alignments 




□ > qi|11415040|ref | NP 068813.11 B matriptase [Homo sapiens] 
qi[ 20988875 [qb | AAH30532. 11 0 Matriptase [Homo sapiens] 
qi | 6647302 | qb | AAD427 65 . 2 [ B matriptase [Homo sapiens] 

qi | 6002714 | gb 1 AAF00109 . 1 | B membrane- type serine protease 1 [Homo sapiens] 

qi 1 13124575 | sp| Q9Y5Y61 ST14 HUMAN B Suppressor of tumorigenicity 14 (Matriptase) (M 
protease 1) (MT-SP1) (Prostamin) (Serine protease 
TADG-15) (Tumor associated differentially-expressed 
gene-15 protein) 
Length = 855 
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Score = 1698 bits (4397), Expect =0.0 

Identities = 805/855 (94%), Positives = 805/855 (94%) 

Query: 1 MGS DRARXXXXXXXXXXXXXX YNS RHE KVNGLEE GVE FL PVNNVKKVE KHG PGRWXXXXX 60 

MGSDRAR YNSRHEKVNGLEEGVEFLPVNNVKKVEKHGPGRW 
Sbjct: 1 MGS DRARKGGGGPKDFGAGLKYNS RHE KVNGLEE GVE FL PVNNVKKVE KHG PGR WVVLAA 60 

Query: 61 XXXXXXXXXXXXXXXXWHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKV 120 

WHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKV 
Sbjct: 61 VLIGLLLVLLGIGFLVWHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKV 120 

Query: 121 KDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXXX 180 

KDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHL 
Sbjct: 121 KDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHLVEEAERVMAEERWM 180 

Query: 181 LPPRARSLKSFWTSWAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPA 240 

LPPRARSLKSFWTSWAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPA 
Sbjct: 181 LPPRARSLKSFWTSWAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPA 240 

Query: 241 HARCQWALRGDADSVLSLTFRS FDLASCDERGSDLVTVYNTLS PME PHALVQLCGTYPPS 300 

HARCQWALRGDADSVLS LT FRS FDLASCDERGSDLVTVYNTLS PME PHALVQLCGTY PPS 
Sbjct: 241 HARCQWALRGDADSVLSLTFRS FDLASCDERGSDLVTVYNTLS PME PHALVQLCGTY PPS 300 

Query: 301 YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 360 

YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 
Sbjct: 301 YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 360 

Query: 361 PPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTS 420 

PPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTS 
Sbjct: 361 PPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTS 42 0 

Query: 421 NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDH 480 

NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDH 
Sbjct: 421 NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDH 480 

Query: 481 SDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSK 540 

SDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSK 
Sbjct: 481 SDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSK 540 

Query: 541 SQQCNGKDDCGDGSDEASCPKVNVVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK 600 

SQQCNGKDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK 
Sbjct: 541 SQQCNGKDDCGDGSDEASCPKVNVVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK 600 

Query: 601 DCDCGLRS FTRQARWGGTDADEGEWPWQVSLHALGQGHI CGAS LI S PNWLVSAAHC Y I D 660 

DCDCGLRS FTRQARWGGTDADE GEWPWQVS LHALGQGH I CGAS LIS PNWLVSAAHC Y I D 
Sbjct: 601 DCDCGLRS FTRQARWGGTDADEGEWPWQVS LHALGQGH I CGAS LI S PNWLVSAAHCY ID 660 

Query: 661 DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKP 720 

DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRI I SHPFFNDFTFDYDIALLELEKP 
Sbjct: 661 DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRI I SHPFFNDFTFDYDIALLELEKP 720 

Query: 721 AEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLL 780 

AEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLL 
Sbjct: 721 AEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLL 780 

Query: 781 PQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYT 840 

PQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYT 
Sbjct: 781 PQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYT 840 

Query: 841 RLPLFRDWIKENTGV 855 
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RLPLFRDWIKENTGV 
Sbjct: 841 RLPLFRDWIKENTGV 855 



□ > qi 1 12249015 | dbj IBAB20376. 1| 0 prostamin [Homo sapiens] 
Length = 855 

Score = 1696 bits (4393), Expect =0.0 

Identities = 804/855 (94%), Positives = 804/855 (94%) 

Query: 1 MGSDRARXXXXXXXXXXXXXXYNSRHEKVNGLEEGVEFLPVNNVKKVEKHGPGRWXXXXX 60 

MGSDRAR YNSRHEKVNGLEEGVE FLPVNNVKKVEKHGPGRW 

Sbjct: 1 MGSDRARKGGGGPKDFGAGLKYNSRHEKWGLEEGVEFLPVNNVKKVEKHGPGRWVVLAA 60 

Query: 61 XXXXXXXXXXXXXXXXWHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKV 12 0 

WHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKV 
Sbjct: 61 VLIGLLLVLLGIGFLVWHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKV 120 

Query: 121 KDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXXX 180 

KDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHL 
Sbjct: 121 KDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHLVEEAERVMAEERVVM 180 

Query: 181 LPPRARSLKSFWTSWAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPA 240 

LPPRARSLKSFWTSWAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPA 
Sbjct: 181 LPPRARSLKS FWTSWAFPTDSKTVQRTQDNSCS FGLHARGVELMRFTTPGFPDS PYPA 240 

Query: 241 HARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPS 300 

HARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPS 
Sbjct: 241 HARCQWALRGDADSVLS LT FRS FDLAS CDERGSDLVTVYNTLS PME PHALVQLCGT Y PPS 300 

Query: 301 YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 360 

YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 
Sbjct: 301 YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 360 

Query: 361 PPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFVVTS 420 

PPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTS 
Sbjct: 361 PPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTS 420 

Query: 421 NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDH 480 

NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDH 
Sbjct: 421 NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDH 48 0 

Query: 481 SDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSK 540 

SDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSK 
Sbjct: 481 SDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSK 540 

Query: 541 SQQCNGKDIX:GDGSDEASCPKVNVVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK 600 

SQQCNGKDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK 
Sbjct: 541 SQQCNGKDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK 600 

Query: 601 DCDCGLRSFTRQARWGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYID 660 

DCDCGLRS FTRQARWGGTDADE GEWPWQVS LHALGQGHI CGAS LI S PNWLVS AAHC Y I D 
Sbjct: 601 DCDCGLRS FTRQARWGGTDADE GEWPWQVS LHALGQGHI CGAS LI S PNWLVS AAHC Y I D 660 

Query: 661 DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKP 720 

DRGFRYSDPTQWT FLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKP 
Sbjct: 661 DRGFRYSDPTQWTVFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKP 720 

Query: 721 AEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLL 780 
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AEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLL 
Sbjct: 721 AEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLL 780 

Query 781 PQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYT 840 

PQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRI FQAGWSWGDGCAQRNKPGVYT 
Sbjct: 781 PQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRI FQAGWSWGDGCAQRNKPGVYT 840 

Query: 841 RLPLFRDWIKENTGV 855 

RLPLFRDWIKENTGV 
Sbjct: 841 RLPLFRDWIKENTGV 855 



11 > qi I 10257390 I qb I AAG15395 . 1 1 B serine protease TADG15 [Homo sapiens] 
Length =855 

Score = 1696 bits (4391), Expect = 0.0 

Identities = 804/855 (94%), Positives = 804/855 (94%) 

Query 1 MGSDRARXXXXXXXXXXXXXXYNSRHEKVNGLEEGVEFLPVNNVKKVEKHGPGRWXXXXX 60 

MGSDRAR YNSRHEKVNGLEEGVEFLPVNNVKKVEKHGPGRW 
Sbjct: 1 MGSDRARKGGGGPKDFGAGLKYNSRHEKVNGLEEGVEFLPVNNVKKVEKHGPGRWVVLAA 60 

Query 61 XXXXXXXXXXXXXXXXWHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKV 120 

WHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKV 
Sbjct: 61 VLIGLLLVLLGIGFLVWHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKV 120 

Query: 121 KDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXXX 180 

KDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHL 
Sbjct: 121 KDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHLVEEAERVMAEERWM 180 

Query 181 lPPRARSLKSFWTSWAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPA 240 

LPPRARSLKSFWTSWAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPA 
Sbjct: 181 LPPRARSLKSFWTSWAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPA 240 

Query 241 HARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPS 300 

HARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPS 
Sbjct: 241 HARCQWALRGDADS VLS LT FRS FDLAS CDERGSDLVTVYNT LS PME PHALVQLCGT Y PPS 300 

Query 301 YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 360 

YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 
Sbjct: 301 YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 360 

Query 361 PPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTS 420 

PPNIDCTWNIEVPNNQHVKV FKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTS 
Sbjct: 361 PPNIDCTWNIEVPNNQHVKVSFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTS 420 

Query 421 NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDH 480 

NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDH 
Sbjct: 421 NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDH 480 

Query 481 SDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSK 540 

SDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSK 
Sbjct: 481 SDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSK 540 

Query 541 SQQCNGKDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK 600 

SQQCNGKDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK 
Sbjct: 541 SQQCNGKDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK 600 

Query: 601 DCDCGLRSFTRQARWGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYID 660 
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DCDCGLRS FTRQARWGGTDADE GEWPWQVS LHALGQGHI CGAS LI S PNWLVSAAHC Y I D 
Sbjct: 601 DCDCGLRS FTRQARWGGTDADE GEWPWQVS LHALGQGHI CGAS LI S PNWLVSAAHC Y ID 660 

Query: 661 DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKP 720 

DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKP 
Sbjct: 661 DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKP 720 

Query: 721 AEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLL 780 

AEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLL 
Sbjct: 721 AEYSSMVRPICLPDASHVFPAGKMWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLL 780 

Query: 781 PQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYT 840 

PQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYT 
Sbjct: 781 PQQI T PRMMCVGFLS GGVDS CQGDS GGPLS S VE ADGR I FQAGWSWGDGCAQRNKPGVYT 840 

Query: 841 RLPLFRDWIKENTGV 855 

RLPLFRDWIKENTGV 
Sbjct: 841 RLPLFRDWIKENTGV 855 



□ > qi I 55637345 | ref|XP 508863.11 PREDICTED: similar to matriptase; suppression of t 
(colon carcinoma) ; membrane-type serine protease; serine 
protease TADG-15; tumor associated differentially 
expressed gene 15 protein [Pan troglodytes] 
Length = 1153 

Score = 1444 bits (3739), Expect =0.0 

Identities = 693/809 (85%), Positives = 705/809 (87%), Gaps = 29/809 (3%) 



Query: 24 SRHEKVNGLEEGVEFLPVNNVKKVEKHGPGRWXXXXXXXXXXXXXXXXXXXXXWHLQYRD 8 3 

S+ +KVNGLEEGVEFLPVNNVKKVEKHGPGRW WHLQYRD 
Sbjct: 201 SKGKKVNGLEEGVEFLPVNNVKKVEKHGPGRWVVLAAVLIGLLLVLLGIGFLVWHLQYRD 260 

Query: 84 VRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKESAV 143 

VRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKESAV 
Sbjct: 261 VRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKESAV 320 

Query: 144 TAFSEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXXXLPPRARSLKSFWTSWAF 199 

TAFSEGSVIAYYWSEFSI PQHL LPPRARSLKSFWTSWAF 
Sbjct: 321 TAFSEGSVIAYYWSEFSIPQHLVEEAERVMAEERVVMLPPRARSLKSFVVTSVVAFRESE 380 

Query: 200 PTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDS 236 

TDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDS 
Sbjct : 381 GQGWAWDWPAFHGEMLYSPSYSSATDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDS 440 

Query: 237 PYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPME PHALVQLCGT 296 

PY PAHARCQWALRGDADSVLS LT FRS FDLAS CDERGSDLV VYNTLSPME PHALVQLCGT 
Sbjct: 441 PY PAHARCQWALRGDADSVLS LT FRS FDLAS CDERGSDLVMVYNTLS PME PHALVQLCGT 500 

Query: 297 YPPSYNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYY 356 

YPPSYNLTFHSSQNVLL+TLITNT RRHPGFEATFFQLPRM SCGGRLRKAQGTFNSPYY 
Sbjct: 501 Y PPSYNLT FHS S QNVLLVTL I TNT RRRHPGFEATFFQLPRMR SCGGRLRKAQGTFNSPYY 560 

Query: 357 PGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQF 416 

PGH Y P PNI DCTWN I E VPNNQHVKVR FKF F Y LLE PGVPAGTC PKD Y VE I NGE KY CGE R S Q F 
Sbjct: 561 PGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQF 620 

Query: 417 WTSNSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWAD 476 
WTSNSNKITVRFHSDQSYTDTGFLAEY+SYDSSDPCPGQFTCRTGRCIRKELRCDGWAD 
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QM T TTH Q nn Cl YT nT flPT . Zi TT V V Q V 
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Query: 


Ml 


CTDHSDELNCSCDAGHQFTCKNKFCKPLEWCDSVNDCGDNSDEQGCSCPAQTFRCSNGK 


536 






r*Tr%HQriFT.MPc;r*nix T-rpjPT'PK'NrK'PPK'DT pwvpnQVMnp^nM^npnriP^ppnnTPRP^Wf^K 




ODJ CT- . 


eft i 


pTnwQniTT Mp^pnzicjupiPTPiosJ'K'PPVPT PTfjvpnQvwnp^nw^TipnfiP^ppzinTPRP^MfiK" 


74 0 


Query: 


537 


CLSKSQQCNGKDDCGDGSDEASCPKVNVVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDG 


596 






pt QK'^nnpMPK'nnprTn^c'np zxcipp \nj\AA r rpTK"HTYRPT.MriT,PT. c iKpTTJPFpnfiKFnP c ^nPr 




bD]Ct . 


/ 1 1 


pt QVQpiOPMPi^nnpf2"np-cir»T? iiQPP r r'\rKr\AA r pp r r'K''u r r i vppT mczt pt QK'^MPPpnp.K'PnPQnf^ 
l.xjo jxoyyurviorvxyxJL^vjxJooiJiijiaoL' t v v4 v v lLi i\ri I I KL*ijrJoxj^Jjor\oiNi^ji*^jj\ji\r4iJk-.o 


ft no 


Query: 


597 


SDEKDCDCGLRSFTRQARWGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAH 


656 






QPiP K"T)PnPP.T R Q ITTDnL\P^A/"P.fZTnZ\nPP.PIJPWnV^T.HZlT.P^fiHTPf^Z\c;T.T ^ PWWT.V^ RZXH 




oD] Cc . 


QUI 


crip vnrnrcT PQPTDpiz\p\Rrp^ r rnz\npp.PTArp , OTPA/c;T uzit P^rzwTP^ziQT.TcipMWT.VQZiziw 


ft fin 


Query: 


657 


CYIDDRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLE 


716 






p vthfip^itp VQnPTTiwrziTrT p.t Hnn^npciiiPnA/nrPDT K'RTTc;HP^ , F , MnvTvnYnTi\T.T.P 
L/ii uui\\ji- Aiouriyn x /-vc xj^xjiiu^o^r\o>xir vj v ^£ii\r\Xjrvr\ j. lonrr e inxse x r ui ux,.tt_uxjrj 




Ck-i ^ 4- . 

bDj CI . 


OOl 


p vTriFiPPTTD vq n PT , piAr r rn it t pt "hfiocipip GnprvnrppT T^PTTQjjpTrpwT^PT'P'nY'nTZiT t p 


^? £. \J 


Query: 


717 


LE K PAE Y S SMVR P I C L PDAS HVF PAGKAI WVTGWGHTQ Y GGT GAL I — LQKGE I R V I NQT 


774 






T PWPZ\P VQCIMWD DT PT DHZiC; JJVPPZiP.t^Z\ T WVTP.Wf^WT'PiVP. + T.4-4- P Pi 




Sb j ct : 




xjfcjKPAfcj I boMV KJrlUijirUAJbriV E FAvjJvA.1 WV I (jWCjnlQ x <jAx AJinxjboxjrjKJ\EjlNJijvi-'-Liyij 


Qpn 


Query: 


775 


TCENLLPQQITPRMMCVGFLSGGVDSCQG 803 








C + P+ + + +C G 




Sbjct : 


981 


QCPHRTPRAFAAPSLRLSASRSLTPACHG 1009 





Score = 208 bits (530), Expect = 5e-52 

Identities = 99/99 (100%), Positives = 99/99 (100%) 

Query: 757 GTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADG 816 

GTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADG 
Sbjct: 1055 GTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADG 1114 

Query: 817 RIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 855 

RIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 
Sbjct: 1115 RIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 1153 



D > qi I 7363445 | ref | NP 035306.2| 111 suppression of tumorigenicity 14 (colon carcinoma 
gi 1 13529566 | gb | AAH05496. 1 1 E3 Suppression of tumorigenicity 14 (colon carcinoma) [M 
qi|7330638|qb|AAD02230.3| B epithin [Mus musculus] 

gi|13959712|sp|P56677|ST14 MOUSE El Suppressor of tumorigenicity 14 (Epithin) 

qi | 2 6342 937 | dbj | BAC35125 . 1 1 . El unnamed protein product [Mus musculus] 
Length = 855 

Score = 1438 bits (3722), Expect = 0.0 

Identities = 668/855 (78%), Positives = 738/855 (86%) 

Query: 1 MGS DRARXXXXXXXXXXXXXX YNSRHE KVNGLE E GVE FL PVNNVKKVE KHGPGRWXXXXX 60 

MGS+R R YNSR E +NG EEGVEFLP NN KKVEK GP RW 

Sbjct: 1 MGSNRGRKAGGGSQDFGAGLKYNSRLENMNGFEEGVEFLPANNAKKVEKRGPRRWWLVA 60 

Query: 61 XXXXXXXXXXXXXXXXWHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKV 120 

WH YR+VRVQKVFNG++RITNE F+DAYENS STEF+SLAS+V 
Sbjct: 61 VLFSFLLLSLMAGLLVWHFHYRNVRVQKVFNGHLRITNEIFLDAYENSTSTEFISLASQV 120 
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Query: 121 KDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXXX 180 

K+ALKLLY+ VP LGPYHK+SAVTAFSEGSVIAYYWSEFSIP HL 
Sbjct: 121 KEALKLLYNEVPVLGPYHKKSAVTAFSEGSVIAYYWSEFSIPPHLAEEVDRAMAVERWT 180 

Query: 181 LPPRARSLKSFWTSWAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPA 240 

LPPRAR+LKSFV+TSWAFP D + +QRTQDNSCSF LHA G + RFTTPGFP+SPYPA 
Sbjct: 181 LPPRARALKSFVLTSWAFPIDPRMLQRTQDNSCSFALHAHGAAVTRFTTPGFPNSPYPA 240 

Query: 241 HARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPS 300 

HARCQW LRGDADSVLSLTFRSFD+A CDE GSDLVTVY++LSPMEPHA+V+LCGT+ PS 
Sbjct: 241 HARCQWVLRGDADSVLSLTFRSFDVAPCDEHGSDLVTVYDSLSPMEPHAWRLCGTFSPS 300 

Query: 301 YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 360 

YNLTF SSQNV L+TLITNT+RRHPGFEATFFQLP+MSSCGG L QGTF+SPYYPGHY 
Sbjct: 301 YNLTFLSSQNVFLVTLITNTDRRHPGFEATFFQLPKMSSCGGFLSDTQGTFSSPYYPGHY 360 

Query: 361 PPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTS 420 

PPNI+CTWNI+VPNN++VKVRFK FYL++P VP G+C KDYVEINGEKYCGERSQFW+S 
Sbjct: 361 PPNINCTWNIKVPNNRNVKVRFKLFYLVDPNVPVGSCTKDYVEINGEKYCGERSQFVVSS 420 

Query: 421 NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDH 480 

NS+KITV FHSD SYTDTGFLAEYLSYDS+DPCPG F C+TGRCIRKELRCDGWADC D+ 
Sbjct: 421 NSSKITVHFHSDHSYTDTGFLAEYLSYDSNDPCP04FMCKTGRCIRKELRCDGWADCPDY 480 

Query: 481 SDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSK 540 

SDE C C+A HQFTCKN+FCKPLFWVCDSVNDCGD SDE+GCSCPA +F+CSNGKCL + 
Sbjct: 481 SDERYCRCNATHQFTCKNQFCKPLFWVCDSVNDCGDGSDEEGCSCPAGSFKCSNGKCLPQ 540 

Query: 541 SQQCNGKDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK 600 

SQ+CNGKD+CGDGSDEASC VNW+CTK+TYRC NGLCLSKGNPECDGK DCSDGSDEK 
Sbjct: 541 SQKCNGKDNCGDGSDEASCDSVNWSCTKYTYRCQNGLCLSKGNPECDGKTDCSDGSDEK 600 

Query: 601 DCDCGLRS FTRQARWGGTDADE GEWPWQVS LHALGQGHI CGAS LI S PNWLVSAAHCY I D 660 

+CDCGLRSFT+QARWGGT+ADEGEWPWQVSLHALGQGH+CGASLISP+WLVSAAHC+ D 
Sbjct: 601 NCDCGLRS FTKQARWGGTNADE GEWPWQVS LHALGQGHLCGAS LI S PDWLVSAAHC FQD 660 

Query: 661 DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKP 720 

D+ F+YSD T WTAFLGL DQS+RSA GVQE +LKRII+HP FNDFTFDYDIALLELEK 
Sbjct: 661 DKNFKYSDYTMWTAFLGLLDQSKRSASGVQELKLKRIITHPSFNDFTFDYDIALLELEKS 720 

Query: 721 AEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLL 780 

E Y S + +VR P I C L PDA+ HV F PAGKAI WVT GWGHT+ GGTGALILQKGEIRVINQTTCE+L+ 
Sbjct: 721 VEYSTWRPICLPDATHVFPAGKAIWVTGWGHTKEGGTGALILQKGEIRVINQTTCEDLM 780 

Query: 781 PQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYT 840 

PQQITPRMMCVGFLSGGVDSCQGDSGGPLSS E DGR+FQAGWSWG+GCAQRNKPGVYT 
Sbjct: 781 PQQI T PRMMCVGFLS GGVDS CQGDSGGPLS SAEKDGRMFQAGWSWGE GCAQRNKPGVYT 840 

Query: 841 RLPLFRDWIKENTGV 855 

RLP+ RDWIKE+TGV 
Sbjct: 841 RLPWRDWIKEHTGV 855 



□ > qi| 16758444 | reflNP 446087.11 El suppression of tumorigenicity 14 [Rattus norvegi 
qi I 25527058 |pir | | JC7775 membrane type-serine protease 1 - rat 

qil25518264|pir| | JC7731 membrane-bound arginine-specific serine proteinase precur 
qi I 9650964 Idbj 1BAB03502.1I El membrane bound serine protease [Rattus norvegicus] 
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qi | 10336527 | dbj | BAB137 65 . 1 1 Esl membrane bound arginine specific serine protease [Ra 
norvegicus] 
Length = 855 

Score = 1433 bits (3710), Expect = 0.0 

Identities = 664/855 (77%) , Positives = 738/855 (86%) 

Query: 1 MGSDRARXXXXXXXXXXXXXXYNSRHEKVNGLEEGVEFLPVWVKKVEKHGPGRWXXXXX 60 

MG++R R YNSR E +NG EEGVEFLPVNN K+VEK GP RW 

Sbjct: 1 MGl^RGRKAGGGSQDFGAGLKYNSRLENMNGFEEGVEFLPVNNAKQVEKRGPRRWWMVA 60 

Query: 61 XXXXXXXXXXXXXXXXWHLQYRDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKV 120 

WH YR+VR+QKVFNG++RITNENF+DAYENS STEF+SLAS+V 
Sbjct: 61 WFSFLLLSLMAGLLVWHFHYRNVRIQKVFNGHLRITNENFLDAYENSTSTEFISLASQV 120 

Query: 121 KDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXXX 180 

K+ALKL+YS VP LGPYHK+S VTAFSEGSVIAYYWSEFSI P HL 
Sbjct: 121 KEALKLMYSEVPVLGPYHKKSTVTAFSEGSVI AYYWSE FS I PPHLEEEVDRAMAVERWT 180 

Query: 181 LPPRARSLKSFWTSWAFPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPA 240 

LPPRAR+LKSFV+TSWAFP D + +QRTQDNSCSF LHARG + RFTTPGFP+SPYPA 
Sbjct: 181 LPPRARALKSFVLTSWAFPIDPRMLQRTQDNSCSFALHARGRTVTRFTTPGFPNSPYPA 240 

Query: 241 HARCQWALRGDADS VLS LT FRS FDLAS CDERGSDLVTVYNTLS PME PHALVQLCGT Y PPS 300 

HARCQW LRGDADSVLSLTFRSFD+A CD SDLVTVY++LSPMEPHA+V+LCGT+ PS 
Sbjct: 241 HARCQWVLRGDADSVLS LT FRS FDVAPCDGHDSDLVTVYDS LS PME PHAWRLCGT FS PS 300 

Query: 301 YNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHY 360 

YNLTF SSQNV L+TLITNT+RRHPGFEATFFQLP+MSSCGG L +AQGTF+SPYYPGHY 
Sbjct: 301 YNLTFLSSQNVFLVTLITNTDRRHPGFEATFFQLPKMSSCGGLLSEAQGTFSSPYYPGHY 360 

Query: 361 PPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTS 420 

PPNI +CTWNI +VPNN++VKVR FK FYL++P +P G+C KDYVE INGEK+CGERSQFW+S 
Sbjct: 361 PPNINCTWNIKVPNNRNVKVRFKLFYLVDPNIPVGSCTKDYVEINGEKFCGERSQFWSS 420 

Query: 421 NSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDH 480 

NS+KITV FHSD SYTDTGFLAEYLSYDS+DPCPG F C+TGRCIRK+LRCDGWADC D+ 
Sbjct: 421 NSSKITVHFHSDHSYTDTGFLAEYLSYDSNDPCPGMFMCKTGRCIRKDLRCDGWADCPDY 480 

Query: 481 SDELNCSCDAGHQFTCKNKFCKPLFWVCDSVTSTDCGDNSDEQGCSCPAQTFRCSNGKCLSK 540 

SDE +C C+A HQF CKN+FCKPLFWVCDSVNDCGD SDE+GCSCPA +F+CSNGKCL + 
Sbjct: 481 SDERHCRCNATHQFMCKNQFCKPLFWVCDSVNDCGDGSDEEGCSCPAGSFKCSNGKCLPQ 540 

Query: 541 SQQCNGKDDCGDGSDEASCPKVWVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK 600 

SQQCNGKDDCGDGSDEASC VN V+CTK+TYRC NGLCL+KGNPECDGK+DCSDGSDEK 
Sbjct: 541 SQQCNGKDDCGDGSDEASCDNVNAVSCTKYTYRCQNGLCLNKGNPECDGKKDCSDGSDEK 600 

Query: 601 DCDCGLRS FTRQARWGGTDADEGEWPWQVSLHALGQGHI CGASLIS PNWLVSAAHCY I D 660 

+CDCGLRSFT+QARWGGT+ADEGEWPWQVSLHALGQGH+CGASLISP+WLVSAAHC+ D 
Sbjct: 601 NCDCGLRS FTKQARWGGTNADE GEWPWQVS LHALGQGHLCGAS LI S PDWLVSAAHCFQD 660 

Query: 661 DRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKP 72 0 

+ F+YSD T WTAFLGL DQS+RSA GVQE +LKRII+HP FNDFTFDYDIALLELEKP 
Sbjct: 661 ET I FKYSDHTMWTAFLGLLDQSKRSASGVQEHKLKRIITH PS FNDFTFDYDIALLELEKP 720 

Query: 721 AEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLL 780 

AEYS++VRPICLPD +HVFPAGKAIWVTGWGHT+ GGT GAL I LQKGE I RVI NQTT CE LL 
Sbjct: 721 AEYSTWRPICLPDNTHVFPAGKAIWVTGWGHTKEGGTGALILQKGEIRVINQTTCEELL 780 

Query: 781 PQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYT 840 
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PQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVE DGRI FQAGWSWG+GCAQRNKPGVYT 
Sbjct: 781 PQQI T PRMMCVGFLS GGVDS CQGDS GGPLS S VE KDGR I FQAGWS WGE GCAQRNKPGVYT 840 

Query: 841 RLPLFRDWIKENTGV 855 

R+P RDWIKE TGV 
Sbjct: 841 RIPEVRDWIKEQTGV 855 



□ > qi | 347 82939 [ qb | AAH05826 . 2 | ID ST14 protein [Homo sapiens] 
Length =526 

Score = 1145 bits (2962), Expect = 0.0 

Identities = 526/526 (100%), Positives = 526/526 (100%) 

Query: 330 TFFQLPRMSSCGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLE 389 

TFFQLPRMS SCGGRLRKAQGTFNS PY Y PGHY PPNI DCTWNI E VPNNQHVKVR FKFFY LLE 
Sbjct: 1 TFFQLPRMS SCGGRLRKAQGTFNS PYY PGHY PPNI DCTWNI EVPNNQHVKVRFKFFYLLE 60 

Query: 390 PGVPAGTCPKDYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDTGFLAEYLSYDS 449 

PGVPAGTCPKDYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDTGFLAEYLSYDS 
Sbjct : 61 PGVPAGTCPKDYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDTGFLAEYLSYDS 120 

Query: 450 SDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKNKFCKPLFWVCD 509 

SDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKNKFCKPLFWVCD 
Sbjct: 121 SDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKNKFCKPLFWVCD 180 

Query: 510 SVlsIDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPKVNVVTCTK 569 

SVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPKVWVTCTK 
Sbjct: 181 SVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPKVNWTCTK 240 

Query: 570 HTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQARWGGTDADEGEWPWQ 629 

HTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQARWGGTDADEGEWPWQ 
Sbjct: 241 HTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQARWGGTDADEGEWPWQ 300 

Query: 630 VS LHALGQGHI CGAS LI S PNWLVSAAHCY I DDRGFR YSDPTQWTAFLGLHDQSQRSAPGV 689 

VSLHALGQGHICGASLISPNWLVSAAHCYIDDRGFRYSDPTQWTAFLGLHDQSQRSAPGV 
Sbjct: 301 VSLHALGQGHI CGASLI S PNWLVSAAHCY IDDRGFRYSDPTQWTAFLGLHDQSQRSAPGV 360 

Query: 690 QERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTG 749 

QERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTG 
Sbjct: 361 QERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTG 420 

Query: 750 WGHTQ Y GGTGAL I LQKGE I RV I NQT TCENLLPQQ IT PRMMCVGFLS GGVDS CQGDSGG PL 809 

WGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLSGGVDSCQGDSGGPL 
Sbjct: 421 WGHTQYGGTGALI LQKGE I RVINQTTCENLLPQQIT PRMMCVGFLS GGVDSCQGDSGGPL 480 

Query: 810 SSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 855 

SSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 
Sbjct: 481 SSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 526 



□ > qi | 50760033 | ref |XP 417872.11 El PREDICTED: similar to suppression of tumorigenic 
of tumorigenicity 14 (colon carcinoma, matriptase, 
epithin) ; suppression of tumorigenicity 14 (colon 
carcinoma) ; suppression of tumorigenicity 14 (colon 
carcinoma matriptase epithin); membran. . . [Gallus gallus] 
Length = 1325 
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Score = 1095 bits (2832), Expect =0.0 

Identities = 504/833 (60%), Positives = 623/833 (74%), Gaps = 7/833 (0%) 



Query: 27 

Sbjct: 496 

Query: 87 

Sbjct: 556 

Query: 147 

Sbjct: 616 

Query: 205 

Sbjct: 675 



EKVNGLEEGVEFLPVNNVKKVEKHGPGRWXXXXXXXXXXXXXXXXXXXXXWHLQYRDVRV 86 
+ +N LEEGVEFLP N KK+EK GP R WH +YR+ V 

QDM^LEEGVEFLPAMNSKKMEKRGPKRRVWAILIIAFLLISLVTGLLVWHFKYRNAPV 555 

QKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKESAVTAF 146 
QKVF G++R+ N F+DAYENS+S EF+ LA KVK ++ +Y P +GPYHKE+ +TAF 
QKVFTGHLRVLNREFIDAYENSSSPEFIMLAKKVKSTIEDIYRSQPDIGPYHKETVITAF 615 

SEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXXX — L P PRARS LKS FWT S WAF PTDS K 204 
SEGSVIAYY SEF +P++ L PR R+ + V SWAFP D 

SEGSVIAYYLSEFIVPKYREEKLDRAMADKQSLVQRLNPRLRN-PTLKVESVVAFPADPS 674 

TVQRTQDNS CS FGLHARGVE LMRFTT PGFPDS PY PAHARCQWALRGDADS VLSLT FRS FD 264 

Q +DNSC F LHA+ E+ F TPGFP SPYP +ARC WALR DA+SV+SLTF++ D 
I AQTARDNSCLFALHAKEGE ITS FNT PGFPHS PY PNNARC YWALRADANSVI SLT FKTLD 734 



Query: 2 65 LASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLITLITNTERRH 324 

L C + SD V VY++LSP+EPHALV+LCG Y PSYNLTF SSQNV+L+ LITN E R+ 
Sbjct: 735 LEQCTD-SSDYVKVYDSLSPVEPHALVRLCGNYAPSYNLTFLSSQNVMLVALITNKEGRY 793 

Query: 325 PGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKF 384 

PGF+A FFQLP+M +CGG LR GTF +PYYP HYPP DC WNIEVP+ ++VKVRF 
Sbjct: 794 PGFKAE FFQLPKMKACGGTLRGE S GT FTT PY Y PAH YPPATDCVWNI E VPSTKNVKVRFNM 853 

Query: 385 FYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDTGFIiAEY 444 

F++LEPGVP +C KDYV+IN +YCGERSQFW S++N+I V+FHSD+SYTDTGF AEY 
Sbjct: 854 FFVLEPGVPVSSCTKDYVQINSTRYCGERSQFWASSTNRIAVQFHSDRSYTDTGFSAEY 913 

Query: 445 LSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKNKFCKPL 504 

LSYDSSDPCPG+F C TGRCI K +RCDGW DC D SDE +C+C QF C+N +CKP 

Sbjct : 914 LSYDSSDPCPGKFACNTGRCIEKSMRCDGWLDCVDGSDERSCTC-TDQQFRCQNGWCKPK 972 

Query: 505 FWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPKV — 562 

FWCD+VNDCGDNSDE CSC +F+C+NGKC+ +Q+C+GKDDCGDGSDE +C K 
Sbjct : 973 FWCDNWDCGDNSDELQCSCANNSFKCNNGKCIPNTQKCDGKDDCGDGSDEGTCSKAAQ 1032 

Query: 563 NWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQARWGGTDAD 622 

+ V C +TY+C NGLC+SK NPECDG++DC D SDE +C+CG RS+ +++R+VGG ++D 
Sbjct: 1033 STVACKSYTYKCRNGLCISKQNPECDGQKDCEDNSDEDNCNCGTRSYIKKSRIVGGQNSD 1092 

Query: 623 EGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGFRYSDPTQWTAFLGLHDQS 682 

GEWPWQVSLHA QGH+CGAS LI S WLVSAAHC+++ +G RYSD + WTA+LGL +QS 
Sbjct: 1093 VGEWPWQVSLHAKSQGHVCGASLISETWLVSAAHCFLELQGIRYSDASLWTAYLGLTNQS 1152 

Query: 683 QRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAG 742 

+R+ VQ +++KRIISH FND+T+DYDIA++EL+ P +S++V+PICLPD++H FP G 
Sbjct: 1153 KRNDANVQMKQIKRI I SHRS FNDYTYDYDIAVIELQS PVTFSAWQPI CLPDSTHNFPVG 1212 



Query: 743 KAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLSGGVDSCQ 8 02 

K +WVTGWG T GG+G+ ILQK EIRVINQT C LL Q+T RMMCVG L+GGVD+CQ 
Sbjct: 1213 KDLWVTGWGATVEGGSGSTILQKAEIRVINQTVCNRLLTDQLTERMMCVGVLTGGVDACQ 1272 

Query: 803 GDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 855 

GDSGGPL SVE GR+F AG W S WGD GC AQRNK P GV Y + R L R WIK+ TG+ 
Sbjct: 1273 GDSGGPLVSVENSGRMFLAGWSWGDGCAQRNKPGVYSRLTALRTWIKQQTGL 1325 



http://www.ncbi.nlm.nih.gov/BLAST/Blast.cgi 



1/13/05 



□ > gi I 49257232 | qb I AAH71077 . 1 1 0 Stl4-A-prov protein [Xenopus laevis] 
Length = 845 

Score = 1008 bits (2607), Expect =0.0 

Identities = 459/840 (54%), Positives = 603/840 (71%), Gaps = 10/840 (1%) 

Query: 22 YNSRHEKVNGLEEGVEFLPVNNVKKVEKHGPGRWXXXXXXXXXXXXXXXXXXXXXWHLQY 81 

YN+R + +NG EEGVEFLP N KKVEK GP + WH Y 

Sbjct: 10 YNNR PQS LNGFEE GVE FL PATNS KKVE KTGPKKKLAI FGLV I GAALLS LT I GLLVWH FAY 69 

Query: 82 RDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKES 141 

R + v K++ GY + I N F+DAYENS + EF L++KV D L+ +Y+G + PY ++ 
Sbjct: 70 RNKPVNKLYTGYLTIANTPFIDAYENSTTAEFSDLSAKVIDTLQTVYNGNKDIAPYLQKC 129 

Query: 142 AVTAFSEG SVIAYYWSEFSIPQHLXXXXXXXXXXXXXXXLPPRARSLKSFWTSWA 198 

+++AFSEG +VI YYWSEFS+P + PR R+ F + S+VA 
Sbjct: 130 SI SAFSEGGGNNVI GYYWSEFSVPAFREAAFEKAI SELKLPSVNPRQRT FALDSLVA 186 

Query: 199 FPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPAHARCQWALRGDADSVLSL 258 

+PTD + + +++SC++ LH+ + +F++PGFPDSPYP +ARC W LR DA ++ L 

Sbjct: 187 YPTDPQIARVFKNSSCAYFLHSSNGWAKFSSPGFPDSPYPRNARCLWTLRADAGRIIHL 246 

Query: 259 TFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLITLIT 318 

F++F + C G D V VY++LSP+EP A ++LCG YPPSYNLTF SS NV+L+TL+T 
Sbjct: 247 HFKTFKMEKCKPNGGDFVMVYDSLSPIEPRAQIRLCGIYPPSYNLTFFSSSNVMLVTLVT 306 

Query: 319 NTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHV 378 

+ + PGF A F QLP+ S CGG +R A G SPY+P HYPP+ +C W+I+VP+N+ V 
Sbjct: 307 DNVGKFPGFLAEFKQLPKTSLCGGLIRDASGFITSPYFPAHYPPSTECIWDIQVPDNKFV 366 

Query: 379 KVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDT 438 

KVRF FYL EPGVP C KD+VEI G+KYCGE+ FW++NS+K++VRF SDQSYTDT 
Sbjct: 367 KVRFNMFYLAEPGVPVTKCTKDFVEIKGQKYCGEKEFFVVS^SSKMSV 426 

Query: 439 GFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKN 498 

GF AEYLSY+ +PCP QFTCR+GRCIR + +CDGW DC D SDE++C+C A QF C N 
Sbjct: 427 GFTAEYLSYEPRNPCPDQFTCRSGRCIRLDQKCDGWNDCEDFSDEMSCTCTA-LQFRCVN 485 

Query: 499 -KFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEA 557 

K CKP +++CD VNDCGD+SDE C CP TF+C NGKC+ SQ+C+ D+CGDGSDEA 
Sbjct: 486 SKLCKPSYFICDGVNDCGDSSDELACKCPNNTFKCGNGKCIPDSQKCDRVDNCGDGSDEA 545 

Query: 558 SCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK — DCDCGLRSFTRQARV 615 

C +V CT++TY+C N C++K NPECDG+ DCSDGSDE C+CG R FT+++R+ 

Sbjct: 546 ECDQVLTTACTEYTYKCKNNQCITKKNPECDGENDCSDGSDENAAKCNCGKRPFTKKSRI 605 

Query: 616 VGGTDADE GEWPWQVS LHALGQGH I CGAS LI S PNWLVS AAHC Y I DDRGFR Y S D PTQWT AF 675 

VGG +AD GE+PWQVSLHA G H CGASL+SP L+SAAHC+ DD RYSD + WTA+ 
Sbjct: 606 VGGVNADT GE FPWQVS LHAKGNKHT CGAS LVS PTML I S AAHC FQDDHQMR Y S DAS LWT AY 665 

Query: 676 LGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDA 735 

LGLHDQ+Q + V ER++KRI++H FND T+D DIA+LELEKP EY+ ++.P+C+P++ 
Sbjct: 666 LGLHDQAQLNTKDWERKIKRIMAHIGFNDNTYDNDIAVLELEKPVEYTDFIQPVCIPES 725 

Query: 736 SHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLS 795 

+H FP GK IWVTGWG + GG A+ILQK EIR+INQT C LL Q+TPRM+C GF+S 
Sbjct: 726 THDFPVGKPIWVTGWGALKEGGGAAVILQKAEIRIINQTECNKLLDGQLTPRMLCAGFVS 785 
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Query: 796 GGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 855 

GG+D+CQGDSGGPLSSVE + +++ AGWSWG+GCA+RNKPGVYT++ + RDWIK+ TG+ 
Sbjct: 786 GGIDACQGDSGGPLSSVELNNKVYLAGWSWGEGCARRNKPGVYTKVSMMRDWIKDKTGL 845 



□ > qi I 9757702 | dbi I BAB08218 . 1 1 B homolog of human MT-SP1 [Xenopus laevis] 
Length = 845 

Score = 998 bits (2581), Expect =0.0 

Identities = 457/840 (54%), Positives = 599/840 (71%), Gaps = 10/840 (1%) 

Query: 22 YNSRHEKVNGLEEGVEFLPVNNVKKVEKHGPGRWXXXXXXXXXXXXXXXXXXXXXWHLQY 81 

YN+R + +NG EEGVEFLP N KKVEK GP + WH Y . 

Sbjct: 10 YNNRPQSLNGFEEGVEFLPATNSKKVEKTGPKKKLAIFGLVIGAALLSLTIGLLVWHFAY 69 

Query: 82 RDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKES 141 

R+ V K++ GY+ I N F+DAYENS + EF L++KV D L+ +Y+G + PY ++ 
Sbjct: 70 RNKPVNKLYTGYLTIANTPFIDAYENSTTAEFSDLSAKVIDTLQTVYNGNKDIAPYLQKC 129 

Query: 142 AVTAFSEG SVIAYYWSEFSIPQHLXXXXXXXXXXXXXXXLPPRARSLKSFWTSWA 198 

+++AFSEG +VI YYWSEFS+P + PR R+ F + S+VA 
Sbjct: 130 S I SAFSEGGGNNVI GY YWSEFSVPAFREAAFEKAI SELKLPSVNPRQRT FALDSLVA 186 

Query: 199 FPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPAHARCQWALRGDADSVLSL 258 

+PTD + + +++SC++ LH+ + +F++PGFPDSPYP +ARC W LR DA ++ L 
Sbjct: 187 Y PTDPQI ARVFKNS SCAY FLHS SNGWAKFS S PGFPDS PY PRNARCLWTLRADAGRI I HL 246 

Query: 259 TFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLITLIT 318 

F ++ F + c G D V VY++LSP+EP A ++LCG YPPSYNLTF SS NV+L+TL+T 
Sbjct: 247 HFKTFKMEKCKPNGGDFVMVYDSLSPIEPRAQIRLCGIYPPSYNLTFFSSSKVMLVTLVT 306 

Query: 319 NTERRHPGFEAT FFQLPRMS SCGGRLRKAQGT FNS PY Y PGH Y PPNI DCTWNI EVPNNQHV 378 

+ + PGF A F QLP+ S CGG +R A G SPY+P HYPP+ + W+I+VP+N+ V 
Sbjct: 307 DNVGKFPGFLAE FKQLPKTSLCGGLIRDASGFITS PYFPAHY PPSTES IWDIQVPDNKFV 366 

Query: 37 9 KVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDT 438 

KVRF FYL EPGVP C KD+VEI G+KYCGE+ FW++NS+K++VRF SDQSYTDT 
Sbjct: 367 KVRFNMFYLAEPGVPVTKCTKDFVEIKGQKYCGEKEFFVVSNNSSKMSVRFVSDQSYTDT 426 

Query: 439 GFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKN 498 

GF AEYLSY+ +PCP QFTCR+GRCIR + +CDGW DC D SDE++C+C A QF C N 
Sbjct: 427 GFTAEYLSYEPRNPCPDQFTCRSGRCIRLDQKCDGWNDCEDFSDEMSCTCTA-LQFRCVN 485 

Query: 499 -KFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEA 557 

K CKP +++CD VNDCGD+SDE C CP TF+C NGKC+ SQ+C+ D+CGDGSDEA 
Sbjct: 486 S KLCKPS Y FI CDGVNDCGDS S DE LACKC PNNT FKCGNGKC I PDSQKCDRVDNCGDGS DE A 545 

Query: 558 SCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEK — DCDCGLRSFTRQARV 615 

C +V CT++TY+C N C++K NPECDG+ DCSDGSDE C+CG R FT+++R+ 

Sbjct: 546 ECDQVLTTACTEYTYKCKNNQCITKKNPECDGENDCSDGSDENAAKCNCGKRPFTKKSRI 605 

Query: 616 VGGTDADE GEWPWQVS LHALGQGH I CGAS LIS PNWLVS AAHC Y I DDRGFRY S DPTQWTAF 675 

VGG +AD GE+PWQVSLHA G H CGASL P L+SAAHC+ DD RYSD + WTA+ 
Sbjct: 606 VGGVNADTGEFPWQVSLHAKGNKHTCGASLGFPTMLISAAHCFQDDHQMRYSDASLWTAY 665 

Query: 676 LGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDA 735 

LGLHDQ+Q + V ERR+KRI++H FND T+D DIA+LELEKP EY+ ++P+C+P++ 
Sbjct: 666 LGLHDQAQLNTKDWERRIKRIMAHIGFNDNTYDNDIAVLELEKPVEYTDFIQPVCIPES 725 
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Query: 736 SHVFPAGKAIWVTGWGHTQYGGTGALI LQKGE I RVINQTTCENLLPQQITPRMMCVGFLS 795 

+H FP GK IWVTGWG + GG A+ILQK EIR+INQT C LL Q+TPRM+C GF+S 
Sbjct: 726 THDFPVGKPIWVTGWGALKEGGGAAVILQKAEIRIINQTECNKLLDGQLTPRMLCAGFVS 785 

Query: 796 GGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 855 

GG+D+CQGDSGGPLSSVE + +++ AGWSWG+GCA+RNKPGVYT++ + RDW K+ TG+ 
Sbjct: 786 GGI DACQGDS GGPLS S VE LNNKVYLAGVVSWGEGCARRNKPGVYTKVSMMRDWSKDKTGL 845 



□ > qi | 52354617 | qb 1 AAH82854 . 1 1 Unknown (protein for MGC: 81690) [Xenopus laevis] 
Length = 845 

Score = 993 bits (2566), Expect =0.0 

Identities = 450/840 (53%), Positives = 592/840 (70%), Gaps = 10/840 (1%) 

Query: 22 YNSRHEKWGLEEGVEFLPVNNVKKVEKHGPGRWXXXXXXXXXXXXXXXXXXXXXWHLQY 81 

YN++ + +NG EEGVEFLP N KKVEK GP + WH Y 

Sbjct: 10 YNNKPQSLNGFEEGVEFLPATNSKKVEKTGPKKKLAIFGWIGAALISLTIGLLVWHFAY 69 

Query: 82 RDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKES 141 

R+ + K++ GY+ I N F++AYENS + EF L++KV + L+ +Y+G + PY ++ 
Sbjct: 70 RNAPIHKLYTGYLTIANTQFIEAYENSTTPEFADLSAKVINTLQTVYNGNKDIAPYLQQC 129 

Query: 142 AVTAFSEGS VIAYYWSEFSIPQHLXXXXXXXXXXXXXXXLPPRARSLKSFWTSWA 198 

+++AFSEG+ VI YYWSEFS+P + PR R+ F V S+VA 
Sbjct: 130 S I SAFSEGNGNNVI GYYWSEFSVPAFREAAFEKAI SE FKLPSVDPRRRT FAVDSIVA 186 

Query: 199 FPTDSKTVQRTQDNS CS FGLHARGVE LMRFTT PGFPDS PY PAHARCQWALRGDADS VLS L 258 

+ TD +T + +++SC++ LH+ + +F++PGFPDSPYP +ARC W LR +A ++ L 

Sbjct: 187 YSTDPQTTRIFRNSSCAYFLHSSDGAVAKFSS PGFPDS PYPRNARCLWTLRANAGQIIHL 246 

Query: 259 TFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLITLIT 318 

F++F + C G D V VY++LSPMEP A ++LCG YPPSYNLTF SS NV+L+TL+T 
Sbjct: 247 HFKTFKMEKCKINGGDHVMVYDSLSPMEPRAQIRLCGIYPPSYNLTFFSSSKVMLVTLVT 306 

Query: 319 NTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHV 378 

+ + PGF A F QLP+ S CGG +R A G SPY+P +YPP +C W+I+VP + V 
Sbjct: 307 DNVGKFPGFLAEFRQLPKKSLCGGHIRDATGVITSPYFPAYYPPKTECIWDIQVPGKKFV 366 

Query: 379 KVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDT 438 

K+RF FYL EPGVP C KD+VEI G+KYCGER FW++NS+KI+VRF SDQSY DT 
Sbjct: 367 KLRFNMFYLAEPGVPVTKCTKDFVEIEGQKYCGERKVFVVSNNSSKISVRFVSDQSYADT 426 

Query: ' 439 GFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKN 498 

GF A+YLSY+ +PCP QFTC+TGRCIR + +CDGW DC D SDE C+C A QF C + 
Sbjct: 427 GFTADYLSYEPRNPCPDQFTCKTGRCIRLDQKCDGWNDCEDFSDEKKCTCTA-QQFRCTD 485 

Query: 499 -KFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEA 557 

K CKP +VCD VNDCGDNSDE C CP TF+C NGKC SQ+C+ D+CGDGSDEA 
Sbjct: 486 SKLCKPSHFVCDGVNDCGDNSDELSCKCPNSTFKCGNGKCFPDSQKCDRTDNCGDGSDEA 545 

Query: 558 SCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKD — CDCGLRS FTRQARV 615 

C +V CT++TY+C N C++K NPECDG+ DC DGSDE C+CG R FT+++R+ 

Sbjct: 546 DCGQVRTTVCTEYTYKCKNNQCITKKNPECDGENDCIDGSDENSAKCNCGKRPFTKKSRI 605 

Query: 616 VGGTDADEGEWPWQVSLHALGQGHICG7\SLISPNWLVSAAHCYIDDRGFRYSDPTQWTAF 675 

4-GG +AD GE+PWQVSLH G H CGASL SP L+SAAHC+ DD RYSD + WTA+ 
Sbjct: 606 I GGVNADLGE F PWQVS LHVKGS KHT CGAS LAS PTT L I S AAHC FQDDHSMR Y S DAS LWTAY 665 
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Query: 67 6 LGLHDQSQRSAPGVQERRLKRI I SHPFFNDFTFDYDIALLELEKPAE YSSMVRPI CLPDA 735 

LGLHDQ+Q + V ER++KRI++H FND T+D DIA+LELEKP +Y+ ++PIC+P++ 
Sbjct: 666 LGLHDQAQLNTKNWERKIKRIMAHIGFNDNTYDNDIAVLELEKPVDYTDFIQPICIPES 725 

Query: 736 SHVFPAGKAIWVTGWGHTQYGGTGALI LQKGE I RVI NQTTCENLLPQQI TPRMMCVGFLS 795 

+H FP GK IWVTGWG + GG A+ILQK EIRVINQT C LL Q+TPRM+C GF+S 
Sbjct: 726 THDFPVGKPIWVTGWGALKEGGGAAVILQKAEIRVINQTECNKLLDGQLTPRMLCAGFVS 785 

Query: 796 GGVDSCQGDSGGPLSSVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 855 

GG+D+CQGDSGGPLSSV+ + +++ AG+VSWG+GCA+RNKPGVYT++ + RDWIK+ TG+ 
Sbjct: 786 GGI DACQGDS GGPLS S VDLNNKVYLAGI VSWGE GCARRNKPGVYTKVSMMRDWI KDKTGL 845 



□ > qi | 17390323 | qb | AAH18 146.1 1 B ST14 protein [Homo sapiens] 
Length = 422 

Score = 920 bits (2379), Expect =0.0 

Identities = 422/422 (100%), Positives = 422/422 (100%) 

Query: 434 SYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQ 493 

SYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQ 
Sb j ct : 1 SYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQ 60 

Query: 494 FTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDG 553 

FTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDG 
Sbjct : 61 FTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDG 120 

Query: 554 SDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQA 613 

SDEASCPKVNVVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQA 
Sbjct: 121 SDEASCPKVNVVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQA 180 

Query: 614 RWGGTDADEGEWPWQVSLHALGQGHI CGASLI S PNWLVSAAHCY IDDRGFRYSDPTQWT 67 3 

RWGGT DADE GEWPWQVS LHALGQGH I CGAS LIS PNWLVSAAHCY IDDRGFRYS DPTQWT 
Sbjct: 181 RWGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGFRYSDPTQWT 240 

Query: 674 AFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLP 733 

AFLGLHDQSQRSAPGVQERRLKRI I SHPFFNDFTFDYDIALLELEKPAE YSSMVRPI CLP 
Sbjct: 241 AFLGLHDQSQRSAPGVQERRLKRI I SHPFFNDFTFDYDIALLELEKPAE YSSMVRPI CLP 300 

Query: 734 DASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGF 793 

DASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGF 
Sbjct: 301 DASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGF 360 

Query: 794 LSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENT 853 

LSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENT 
Sbjct: 361 LSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENT 420 

Query: 854 GV 855 
GV 

Sbjct: 421 GV 422 



Score = 38.5 bits (88), Expect = 0.87 

Identities = 25/95 (26%), Positives = 34/95 (35%), Gaps = 26/95 (27%) 

Query: 395 GTCPKDYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCP 454 

G C + NG+ CG+ S N +T H+ 
Sbjct: 102 GKCLSKSQQCNGKDDCGDGSDEASCPKVNWTCTKHT 138 
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Query: 455 GQFTCRTGRCI RK- E LRCDGWADCTDHSDE LNCSC 488 

+ C G C+ K CDG DC+D SDE +C C 
Sbjct: 139 --YRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDC 171 



□ > qi | 47225569 | emb | CAG12052 . 1 1 unnamed protein product [Tetraodon nigroviridis ] 
Length = 730 

Score = 740 bits (1911), Expect =0.0 

Identities = 363/741 (48%), Positives = 471/741 (63%), Gaps = 16/741 (2%) 

Query: 120 VKDALKLLYSGVPFLGPYHKESAVTAFSEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXX 179 

V L Y FL YH +S VTAFSEG VIAYYWS+F IP H 

Sbjct: 1 VSSQLGTTYKNDTFLSKYHTKS WTAFSEG-VI AY YWSQFDI PVHDLE I LPI FSEERVLS 59 

Query: 180 XLPP RARS LKS FWTS WAFPTDSKTVQRTQDNSCS FGLHARGVE LMR FTT PGFPDS 236 

L RRS+V V VT+CFLA EM F++PG+P 

Sbjct: 60 VLENNIVRTRSTQGSVRIREVTASCRDIWVAATGQDDCFFRLEAEEQEKM-FSSPGYPVK 118 

Query: 237 PYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGT 296 

YP +RCQW +R ++ +S++F F + D+ D V++Y++LSP + A+ + CG 
Sbjct: 119 -YPPRSRCQWQIRASEENAISVSFPFFHIE — DDCSDDFVSIYDSLSPDDSQAITEKCGQ 175 

Query: 297 YPPSYNLTFHSSQNVLLITLITNTERRHPGFEATFFQLPRMSS — CGGRLRKAQGTFNSP 354 

PPS L SS N++LI LIT++E + PGF A + +P+ ++ CGG L G SP 
Sbjct: 176 RPPSNPLEVTSSGNIMLINLITDSEVQQPGFLARYSAIPKSNATTCGGVLTADTGVITSP 235 

Query: 355 YYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERS 414 

YP YPP +DC W I+VP ++V+++F F + EPGV C KD+VE+ G KYCGE 
Sbjct: 236 LY PS S Y PPAVDCKWT I KVPAGRNVRI KFTLFRMKE PGVDTRVCHKDFVEVMGT KYCGE I P 295 

Query: 415 QFWTSNSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGW 474 

F +T+N N + V+FHSD SYTD GF AEY ++D S+PCP +F C +GRCI KELRCDGW 
Sbjct: 296 YFTLTTNENVLDVKFHSDGSYTDKGFSAEYSAFDPSNPCPNKFACNSGRCISKELRCDGW 355 

Query: 475 ADCTDHSDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSN 534 

DC D SDE+ C C+ QF CKN CKP WVCD VNDCGD SDE CSC FRCS+ 
Sbjct: 356 NDCGDMSDEMMCQCEKD-QFACKNGLCKPNLWVCDRVNDCGDWSDEAKCSCEKNEFRCSS 414 

Query: 535 GKCLSKSQQCNGKDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCS 594 

G CL + CN K DC DGSDEA+C + + TC++ TY C N +C++K N ECD DCS 
Sbjct: 415 GLCLPQDWCNQKRDCVDGSDEANC-ETSKGTCSEFTYMCKNQVCINKLNAECDRVNDCS 473 

Query: 595 DGSDEKDCDCGLRS FTRQARWGGTDADEGEWPWQVSLHALGQGHI CGASLI SPNWLVSA 654 

D SDE C CG R + + R+VGG +A+ GEWPWQVSLH L GH+CGAS+IS WL+SA 
Sbjct: 474 DSSDEAACGCGTRPY-KLNRIVGGQNAEVGEWPWQVSLHFLTYGHVCGASIISERWLLSA 532 

Query: 655 AHCYIDDRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIAL 714 

AHC++ + W + G+ DQ ++ G+ R LKRIISHP +N T+DYDIAL 

Sbjct: 533 AHCFVTSSPQNHI-AANWLTYSGMQDQYKQD — GILRRPLKRIISHPDYNQMTYDYDIAL 589 

Query: 715 LE LE K PAE Y S SMVR P I C L PDAS HV F P AGKAI WVT GW GHTQ Y GGT GAL I LQKGE I R V I NQT 774 

LEL +P E+++ ++PICLPD+SH+FPAG + WVTGWG + GG A +LQK +++IN T 
Sbjct: 590 LELSEPLEFTNTIQPICLPDSSHMFPAGMSCWVTGWGAMREGGQKAQLLQKT^SVKIINGT 649 

Query: 775 T CENLL PQQ I T PRMMCVGFLS GGVDS CQGDS GGPLS S VE ADGR I FQAGWSWGDGCAQRN 834 

C + Q+T RM+C GFL+GGVD+CQGDSGGPL E G+ FQAG+VSWG+GCA+RN 
Sbjct: 650 VCNEVTEGQVTSRMLCSGFLAGGVDACQGDSGGPLVCFEESGKWFQAGIVSWGEGCARRN 709 
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Query: 835 KPGVYTRLPLFRDWIKENTGV 855 

KPG+YTR+ R WIKE G+ 
Sbjct: 710 KPGI YTRVTKLRKWI KEQI GI 730 



□ > qi| 10179932 |qb I AAG13949.il El serine protease SNC19 [Homo sapiens] 
Length = 325 

Score = 727 bits (1876), Expect =0.0 

Identities = 325/325 (100%), Positives = 325/325 (100%) 

Query: 340 CGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPK 399 

CGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPK 
Sbjct: 1 CGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPK 60 

Query: 400 DYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTC 459 

DYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTC 
Sbjct: 61 DYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTC 120 

Query: 460 RTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSD 519 

RTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSD 
Sbjct: 121 RTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSD 180 

Query: 52 0 EQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPKVNVVTCTKHTYRCLNGLC 57 9 

EQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPKVNVVTCTKHTYRCLNGLC 
Sbjct: 181 EQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPKVNVVTCTKHTYRCLNGLC 240 

Query: 580 LSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQARWGGTDADEGEWPWQVSLHALGQGH t 639 

LSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQARWGGTDADEGEWPWQVSLHALGQGH 
Sbjct: 241 LSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQARVVGGTDADEGEWPWQVSLHALGQGH 300 

Query: 640 I CGAS LIS PNWLVSAAHC Y I DDRGF 664 

ICGASLISPNWLVSAAHCYIDDRGF 
Sbjct: 301 ICGASLISPNWLVSAAHCYIDDRGF 325 



□ > qi 1 47227882 | emb | CAG09045 . 1 1 unnamed protein product [Tetraodon nigroviridis] 
Length = 910 

Score = 702 bits (1811), Expect =0.0 

Identities = 363/827 (43%), Positives = 482/827 (58%), Gaps = 61/827 (7%) 

Query: 33 EEGVEFLPVNNVKKVEKH-GPGRWXXXXXXXXXXXXXXXXXXXXXWHLQYR-DVRVQKVF 90 

E V+FLP ++ K+EK GPG+ WH ++R D+RV+KV+ 

Sbjct: 1 ESSVQFLPASDNTKLEKKKGPGKTGAVIGWILAVWALMIGLLVWHFRFRQDIRVKKVY 60 

Query: 91 NGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKESAVTAFSEGS 150 

G MRITN+ F +AYE+SN+TEF +LA +V LK +YS PL Y+ S V AFSEGS 
Sbjct: 61 TGSMRITNQVFENAYEDSNTTEFKALAKQWTQLKTIYSKSPQLAKYYVGSTVQAFSEGS 120 

Query: 151 VIAYYWSEFSIPQHLXXXXXXXXXXXXXXXLPPRA RSLKS FWTSWAFPTD 202 

V+AYY SEF +P + R+ S + +W+ D 

Sbjct: 121 WAYYLSEFRVPVGQEAAVDKAMAAMEQLWKEQRSVYRAGNSLMFDNWSSALDTRMTS 180 

Query: 203 SKTVQRTQDNSCS FGLHARGVE LMRFTT PGFPDS PY PAHARCQWALRGDA 252 

S + + S F HA+ + + +PGFP+ PY + QW LR D 
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Sbjct: 181 ASFSRSPELNSWIVDSASGSLQFSEHAKPNYIGQIQSPGFPNHPYSPNTLVQWRLRADP 240 



Query: 253 DSVLS LT FRS FDLAS CDERGSDLVTVYNTLS PME PHALVQLCGT Y PPS YNLT FHSSQNVL 312 

D V+ L F + +L + +D + +Y++L +E + +LCG + PS +TF SS NVL 

Sbjct: 241 DYVIQLKFDTINLEN — NCTNDFIRIYDSLVSIESRIMDELCGYHSPSEPMTFISSGNVL 298 

Query: 313 LITLITNTERRHPGFEATFFQLPRMSS CGGRLRKAQGT FNS P Y Y PGH Y P PNI DCTWN 369 

L+ + TN + +PGF A Q+ R S CGG+L G F SP +P +YP I C W 

Sbjct: 299 LVAMATNDMKNY PGFRAQVSQVKRGS PAT VCGGKLS GENGK FT S PN F PNY Y PAR I S CQWT 358 

Query: 370 IEVPNNQHVKVRFKFFYLLEPGVP-AGTCPKDYVEINGEKYCGERSQFVVT — SNSNKIT 426 

I+VP + VKV+F+ F L EPG CPKDYV+ING+K CGE + + VT S++N + 

Sbjct: 359 IQVPAGKVVKVKFRKFLLFEPGQERVKNCPKDYVQINGKKVCGEQPDWAVTETSSTNTMD 418 

Query: 427 VRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWJ\DCTDHSDELNC 486 

V F SD S+ D GF AE+ + D+ + CP +F CR RCI+ EL+CDGW DC D SDE+NC 
Sbjct: 419 VLFVSDSSHVDRGFEAEFQAVDADNLCPNKFQCRNQRCIKSELQCDGWNDCGDMSDEVNC 478 / 

Query: 487 SCDAGH QFTCKNKFCKPLFWVCDSVNDCGD 516 

S H TCKN CKP+FW CD V+DCGD 

Sbjct: 479 SERRSHTRPIVRNPSKMKHSFKIHSPKFCSPECNSKDITCKNGLCKPMFWKCDGVDDCGD 538 

Query: 517 NSDEQGCS-CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPKVNWTCTKHTYRCL 575 

+DEQ C CP F+C N KC+S+ QC+ +DDCGDGSDE +C + CT TYRC 

Sbjct: 539 KTDEQNCGDCPTGQFKCQNKKCISEKNQCDSRDDCGDGSDEINCGRNTDAKCTDLTYRCS 598 

Query: 57 6 NGLCLSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQARWGGTDADEGEWPWQVSLHAL 635 

N C++K NPECDG DC DGSDE +C CG F R +R+VGG ADE GE + PWQVS LH 
Sbjct: 599 NNKCITKVNPECDGTPDCEDGSDEVNCGCGKNVF-RTSRIVGGEVADEGEFPWQVSLHIK 657 

Query: 636 GQGHICGASLISPNWLVSAAHCYIDDRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLK 695 

+GH+CGAS+I S PNWLV+AAHC D+ R S P W A+LGLH Q Q V R LK 

Sbjct: 658 NRGHVCGASI I S PNWLVTAAHCVQDEGTLRLSQPGSWEAYLGLHVQ-QNIKKSVWRNLK 716 

Query: 696 RIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQY 755 

RII HP +N++T+D D+AL+EL+ P YS ++PICLP H FP G+ +W+TGWG T+ 
Sbjct: 717 RIIPHPNYNEYTYDNDVALMELDSPVTYSDYIQPICLPAPQHDFPVGETVWITGWGATRE 776 

Query: 756 GGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLSGGVDSCQ 802 

G A +LQK ++R+INQ TC +L+ QIT RM+C G L+GGVD+CQ 
Sbjct: 777 E GPAATVLQKAQVR 1 1 NQDT CNS LMGGQ ITS RML CAGVLT GGVD ACQ 823 



□ > qi | 49900216 | qh | AAH76994 . 1 1 K3 MGC89623 protein [Xenopus tropicalis] 

qi I 52346064 [ref | NP 001005075.11 13 MGC89623 protein [Xenopus tropicalis] 
Length = 663 

Score = 690 bits (1781), Expect = 0.0 

Identities = 317/609 (52%), Positives = 417/609 (68%), Gaps = 10/609 (1%) 

Query: 22 YNSRHEKVNGLEEGVEFLPVNNVKKVEKHGPGRWXXXXXXXXXXXXXXXXXXXXXWHLQY 81 

YN+R + +NG EEGVEFLP N KKVEK GP + WH Y 

Sbjct: 10 YNNRPQSMNGFEEGVEFLPAANTKKVEKAGPKKKLAIFGWIGAALLSLTIGLLVWHFAY 69 

Query: 82 RDVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKES 141 

R+ VQK++ GY+RI N FV+AYENS + EF L+ KV L+ LY+G + PY ++ 
Sbjct : 70 RNAPVQKLYTGYLRIANTQFVEAYENSTTREFADLSVKVISTLRTLYNGEKDIAPYLQQC 129 
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Query: 142 AVTAFSEGS VIAYYWSEFSIPQHLXXXXXXXXXXXXXXXLPPRARSLKSFWTSWA 198 

+++AFSEGS V+ YYWSEFS+P LP ++F V S+VA 

Sbjct: 130 SISAFSEGSDNNWGYYWSEFSVP AFREEAFEKAI SE LKLPTVNLRQRAFAVDS LVA 186 

Query: 199 FPTDSKTVQRTQDNSCSFGLHARGVELMRFTTPGFPDSPYPAHARCQWALRGDADSVLSL 258 

+PTD + + +++SC+F LH+ + +F++PGFPD+PYP +ARC W LR DA ++ L 
Sbjct: 187 YPTDPQIARNFKNSSCAFFLHSSAGVMTKFSSPGFPDTPYPPNARCLWTLRADAGQMIRL 246 

Query: 259 TFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLITLIT 318 

F++F + C D V VY++LSP+EP A ++LCG YPPSYNLTF SS NV+L+TL+T 

Sbjct: 247 KFKTFKMEKCKANAGDFVMVYDSLSPIEPRAQIRLCGIYPPSYNLTFFSSSNVMLVTLVT 306 

Query: 319 NTERRHPGFEATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHV 378 

+ + PGF A F Q P+ S CGG +R A G F SPY+PGHYPP I+C W+I+VP+N+ V 
Sbjct: 307 DNVGKFPGFLAEFSQFPKTSLCGGYIRDASGVFTSPYFPGHYPPKIECIWDIQVPDNKFV 366 

Query: 379 KVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDT 438 

K+RF FYL EPGVP C KD+VE ING+KYCGER FW++NS+K++VRF SDQSYTDT 
Sbjct: 367 KLRFNMFYLAEPGVPVTKCTKDFVEINGQKYCGERKFFWSNNSSKMSVRFVSDQSYTDT 426 

Query: 439 GFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKN 498 

GF AEYLSY+ +PCP QF C++GRCIR + +CDGW DC D SDE +C+C A QF C N 
Sbjct: 427 GFTAEYLSYEPRNPCPDQFACKSGRCIRLDQKCDGWNDCEDFSDEKSCTCTA-LQFRCTN 485 

Query: 499 -KFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEA 557 

K CKP ++VCD VNDCGD+SDE C CP T++C NGKC+ +SQ+C+ D+CGDGSDEA 
Sbjct: 486 SKLCKPSYFVCDGVNDCGDSSDELACQCPNNTYKCGNGKCIPESQKCDRTDNCGDGSDEA 545 

Query: 558 SCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDE — KDCDCGLRS FTRQARV 615 

C +V TCT++TY+C N C++K NPECDG+ DCSDGSDE C+CG R FT+++R+ 

Sbjct: 546 ECGRVLTTTCTEYTYKCKNNQCITKKNPECDGENDCSDGSDEISAKCNCGKRPFTKKSRI 605 



Query: 616 VGGTDADEG 624 

VGG +AD G 
Sbjct: 606 VGGVNADSG 614 



Score = 87.0 bits (214), Expect = 2e-15 

Identities = 37/60 (61%), Positives = 51/60 (85%), Gaps = 3/60 (5%) 

Query: 796 GGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 855 

GGV++ DSGGPLSSVE + +++ AG+VSWG+GCA+RNKPGVYTR+ + RDWI++ TG+ 
Sbjct: 607 GGVNA DSGGPLSSVELNNKVYLAGIVSWGEGCARRNKPGVYTRVAMMRDWIRDKTGL 663 



□> qi| 18655574 | pdb | 1SAX|A B Chain A, Crystal Structure Of Mtspl (Matriptase) 

qi | 18655572 | pdb | 1EAW| C § Chain C, Crystal Structure Of The Mtspl (Matriptase) -Bpt 
(Aprotinin) Complex 

qij 18655570[pdb| 1EAW1A II Chain A, Crystal Structure Of The Mtspl (Matriptase) -Bpt 
(Aprotinin) Complex 
Length = 241 

Score = 512 bits (1319), Expect = e-143 

Identities = 241/241 (100%), Positives = 241/241 (100%) 

Query: 615 WGGTDADEGEWPWQVS LHALGQGH I CGAS LI S PNWLVSAAHC Y I DDRGFR YSDPTQWTA 674 
WGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGFRYSDPTQWTA 
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Sbjct: 1 WGGTDADEGEWPWQVS LHALGQGHI CGAS LI S PNWLVSAAHC Y I DDRGFRYSDPTQWTA 60 

Query: 675 FLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPD 734 

FLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPD 
Sbjct: 61 FLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPD 120 

Query: 735 ASHVFPAGKAJWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFL 794 

ASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFL 
Sbjct: 121 ASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFL 180 

Query: 795 SGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTG 854 

SGGVDSCQGDSGGPLSSVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWIKENTG 
Sbjct: 181 SGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTG 240 

Query: 855 V 855 
V 

Sbjct: 241 V 241 



□ > qi | 47209610 | emb | CAF89594 . 1 1 unnamed protein product [Tetraodon nigroviridis] 
Length = 435 

Score = 453 bits (1166), Expect = e-126 

Identities = 210/424 (49%), Positives = 269/424 (63%), Gaps = 28/424 (6%) 

Query: 453 CPGQFTCRTGRCIRKELRCDGWADCTDHSDELNC SCD 489 

CPG F C C+ LRCDGW DC D+SDE +C CD 
Sbjct: 14 CPGSFWCHNNLCLNPALRCDGWDDCGDNSDERDCRESTPALAPVTTCTDGAFLFLSAECD 73 

Query: 490 AGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGC-SCPAQTFRCSNGKCLSKSQQCNGKD 548 

A Q C+N CKP FW CD +DCGDNSDE C C F C N +C+ +S++C+G+D 

Sbjct: 74 AS-QLRCQNGRCKPKFWQCDGTDDCGDNSDEDNCVKCKPGEFLCRNQRCVPESRRCDGRD 132 

Query: 549 DCGDGSDEASCPK-VNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGLR 607 

DC DGSDE+ C + V + C++H++RC NG C+SK NP+CDG+ DC D SDE C CG R 
Sbjct: 133 DCSDGSDESQCKRSVLLQQCSEHSFRCRNGKCISKLNPDCDGELDCEDASDEDGCHCGKR 192 

Query: 608 S FTRQAR WGGTDADE GEWPWQVS LHALGQGH I CGAS LIS PNWLVSAAHC Y I DDRGFR Y S 667 

+ R +R+VGG + E EWPWQVSLH G GH CGAS++S WL++AAHC + YS 
Sbjct: 193 PY-RSSRIVGGQVSQEAEWPWQVSLHIKGTGHTCGASVLSNRWLLTAAHCVRNPGSAMYS 251 

Query: 668 DPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSMV 727 

P QW LGLH+Q Q S V+ R +K+II H ++ T+D DIAL+EL+ + + 

Sbjct: 252 QPEQWEVLLGLHEQGQTSKWTVK-RSVKQIIPHHRYDPVTYDNDIALMELDANVTLNQNI 310 

Query: 728 RPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITPR 787 

PICLP ++ FP G W+TGWG T+ GG A +LQK +R+IN T C +L+ ++T 
Sbjct: 311 YPICLPSPTYYFPVGSEAWITGWGATREGGRPASVLQKAAVRIINSTVCRSLMSDEVTEG 370 

Query: 788 MMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFRD 847 

M+C G L GGVD+CQGDSGGPLS GR+F AGWSWGDGCA+RNKPGVYTR +R 

Sbjct: 371 MLCAGLLRGGVDACQGDSGGPLSFTSPSGRVFLAGWSWGDGCARRNKPGVYTRTTQYRS 430 

Query: 848 WIKE 851 
WI+E 

Sbjct: 431 WIRE 434 
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□ > gi I 57086055 jref |XP 546396.11 PREDICTED: similar to matriptase [Canis familiaris 
Length = 2045 

Score = 436 bits (1120), Expect = e-120 

Identities = 204/247 (82%), Positives = 224/247 (90%), Gaps = 3/247 (1%) 



Query: 


212 


NSCSFGLHARGVELMRFTTPGFPDSPYPAHARCQWALRGDADSVLSLTFRSFDLASCDER 


271 






+SCSF LHAR ELMRFTTPGFPDSPYPA ARCQW LRGDAD VLSLTFRSFD+A+CD+R 




Sbjct : 


993 


DSCSFALHARSGELMRFTTPGFPDSPYPARARCQWTLRGDADFVLSLTFRSFDVATCDDR 


1052 


Query: 


272 


GSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLITLITNTERRHPGFEATF 


331 






GSDLV VY+TLSP+EP A+VQLCGTYPPSYNLTF SSQNVLL+TLITNTERRHPGFEATF 




Sbjct: 


1053 


GSDLVMVYDTLSPVEPRAWQLCGTYPPSYNLTFLSSQNVLLVTLITNTERRHPGFEATF 


1112 


Query: 


332 


FQLPRMSSCGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLEPG 


391 






FQLP++SSCGG LR +QGTF+S PY Y PGHY PPN++CTW+I E VP++Q+VKV FK FY+LEP 




Sbjct: 


1113 


FQLPKLSSCGGSLRGSQGTFSSPYYPGHYPPNMNCTWDIEVPSHQNVKVLFKAFYMLEPN 


1172 


Query: 


392 


VPAGTCPKDYVEINGEKYCGERSQFWTSNSNKITVRFHSDQSYTDTGFLAEYLSYDSSD 


451 






P GTC KDYVE+NGEKYCGER QFWTS SNKITVRFHSDQSYTDTGFLAEYLSYDSSD 




Sbjct: 


1173 


TPLGTCSKDYVEVNGEKYCGERPQFWTSRSNKITVRFHSDQSYTDTGFLAEYLSYDSSD 


1232 


Query: 


452 


PCPG 455 








P PG 




Sbjct: 


1233 


REYPSPG 1239 





Score = 424 bits (1090), Expect = e-117 

Identities = 181/213 (84%), Positives = 196/213 (92%) 

Query: 453 CPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCSCDAGHQFTCKNKFCKPLFWVCDSVN 512 

CPG+F C TGRCIR ELRCDGWADCTD+SDELNC C+A +QFTCKNKFCKPLFWVCDSVN 
Sbjct: 1448 CPGKFMCHTGRCIRNELRCDGWADCTDYSDELNCQCNATYQFTCKNKFCKPLFWVCDSVN 1507 

Query: 513 DCGDNSDEQGCSCPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASCPKVNVVTCTKHTY 572 

DCGDNSDEQ CSCPAQTFRC NGKCL ++QQC+G D+CGDGSDEA+C V V CTKHTY 
Sbjct: 1508 DCGDNSDEQECSCPAQTFRCGNGKCLPQNQQCDGTDNCGDGSDEATCDLVRTVACTKHTY 1567 

Query: 573 RCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGLRSFTRQARWGGTDADEGEWPWQVSL 632 

RC NGLCLSK NPECDGK+DCSDGSDEKDCDCGLRSFTRQ+RWGGT+ADEGEWPWQVSL 
Sbjct: 1568 RCHNGLCLSKSNPECDGKKDCSDGSDEKDCDCGLRSFTRQSRWGGTNADEGEWPWQVSL 1627 

Query: 633 HALGQGHI CGAS LIS PNWLVS AAHC Y I DDRGFR 665 

H LGQGH+CGAS+ISPNWLVSAAHC+I DDRGFR 
Sbjct: 1628 HVLGQGHVCGAS I IS PNWLVS AAHC F I DDRGFR 1660 



Score = 127 bits (319), Expect = le-27 

Identities = 66/99 (66%), Positives = 71/99 (71%), Gaps = 14/99 (14%) 

Query: 757 GTGALILQKGEIRVINQTTCENLLPQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADG 816 

G+GAL+LQKGEIRVINQTT L GDSGGPLSSVEADG 
Sbjct: 1961 GS GALVLQKGE I RVI NQTTWR PALAADHAAH DGDSGGPLSSVEADG 2006 

Query: 817 RIFQAGWSWGDGCAQRNKPGVYTRLPLFRDWIKENTGV 855 

RIFQAGWSWGDGCAQR+KPGVYTRL +FRDWI+E TGV 
Sbjct: 2007 RIFQAGWSWGDGCAQRDKPGVYTRLAVFRDWIREETGV 2045 
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Score = 122 bits (307), Expect = 3e-26 
Identities = 56/96 (58%), Positives = 70/96 (72%) 

Query: 29 VNGLEEGVEFLPVNNVKKVEKHGPGRWXXXXXXXXXXXXXXXXXXXXXWHLQYRDVRVQK 88 

++G+EEGVEFLPVNN +KVEK GP RW WH QY+++RVQK 

Sbjct: 536 MSGVEEGVEFLPVNNTRKVEKRGPKRWLLVTGLAGLVLLSLVACLLMWH 595 

Query: 89 VFNGYMRITNENFVDAYENSNSTEFVSLASKVKDAL 124 

+FNGY+RITNENFVDAYENSNSTEF +LA++VK+A+ 
Sbjct: 596 I FNG Y LR I TNE N FVDAY E NS NS T E FANLANRVKE AV 631 



Score =70.5 bits (171), Expect = 2e-10 
Identities = 36/64 (56%), Positives = 42/64 (65%) 

Query: 136 PYHKESAVTAFSEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXXXLPPRARSLKSFWTS 195 

P H ++ + SEGSVIAYYWSEFSIPQ+L LPPRAR+L SFV+TS 

Sbjct: 792 PSHAPASRLSPSEGSVIAYYWSEFSIPQYLVEDAERVMAQERAAVLPPRARALNSFVLTS 851 

Query: 196 WAF 199 
WAF 

Sbjct: 852 WAF 855 



Score = 44.3 bits (103), Expect = 0.016 
Identities = 20/23 (86%), Positives = 21/23 (91%) 

Query: 124 LKLLYSGVPFLGPYHKESAVTAF 146 

LKLLYSGVP LGPYHK+S VTAF 
Sbjct: 694 LKLLYSGVPSLGPYHKKSMVTAF 716 



□ > qi| 56417706 | emb 1 CAI19335 . 1| OTTHUMP00000028798 [Homo sapiens] 
Length = 788 

Score = 398 bits (1022), Expect = e-109 

Identities = 251/786 (31%), Positives = 375/786 (47%), Gaps = 89/786 (11%) 

Query: 83 DVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKESA 142 

+V V +V++G +R+ N +F S+ F S +K + LK L + LG Y+ S+ 

Sbjct: 72 EVMVSQVYSGSLRVLNRHFSQDLTRRESSAFRSETAKAQKMLKELITSTR-LGTYYNSSS 130 

Query: 143 VTAFSEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXXXLPPRARSLKSFWTSWAFPTD 202 

V +F EG + ++W IP+H L P +++ +V +++ 

Sbjct: 131 VYSFGEGPLTCFFWFILQIPEH RRLMLSPEV — VQALLVEELLSTVNS 176 

Query: 203 SKTVQ-RTQDNSCSFGLHARGVELMRFTTPGFP DS PY PAHARCQWALRGDADS VLS L 258 

SVR+ GL G + G P +CW L+G D +L L 

Sbjct: 177 SAAVPYRAEYEVDPEGLVILGCYRYSYVGQGQVLRLKGPDHLASSCLWHLQGPKDLMLKL 236 

Query: 259 TFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLITLIT 318 

+ LA C +R + +Y+ P+E + + G + +S ++ + 

Sbjct: 237 RLE-WTLAECRDR LAMYDVAGPLEKRLITSVYGCSRQEPWEVLASGAIMAWWKK 2 91 

Query: 319 NTERRHPGF EATFFQLPRMSSCGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPN 374 
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+ F + FQ ++ +QG ++PY+P +Y P C+W++ VP+ 

Sbjct: 292 GLHSYYDPFVLSVQPWFQACEVNLTLDNRLDSQGVLSTPYFPSYYSPQTHCSWHLTVPS 351 



Query: 375 NQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCG ERSQFWTSNSNKIT 426 

+ + Y L C + I + CG ER V T+ IT 
Sbjct: 352 LDYGLALWFDAYALRRQKYDLPCTQGQWTIQNRRLCGLRILQPYAERIPWATAG IT 408 

Query: 427 VRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNC 486 

+FSSTG YY+ SDPCPG+F C 
Sbjct: 409 INFTSQISLTGPGVRVHYGLYNQSDPCPGEFLCSV 443 

Query: 487 SCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCS-NGKCLSKSQQCN 545 

N CP CD V DC + DE+ CCA TF+C + C+S + C+ 
Sbjct: 444 NGLCVP ACDGVKDC PNGLDE RNCVCRA- T FQCKE DST C I S L PKVCD 488 

Query: 546 GKDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCG 605 

G+ DC +GSDE C + V C T++C + C+ K NP+CDG+ DC DGSDE+ CDCG 
Sbjct: 489 GQPDCLNGSDEEQCQEG — VPCGTFTFQCEDRSCVKKPNPQCDGRPDCRDGSDEEHCDCG 546 

Query: 606 LRSFTRQARWGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGFR 665 

L+ +R+VGG + EGEWPWQ SL G+ HICG +LI + W+++AAHC+ +D 
Sbjct: 547 LQG — PSSRIVGGAVSSEGEWPWQASLQVRGR-HICGGALIADRWVITAAHCFQEDS 600 

Query: 666 YSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSS 725 

+ WT FLG Q+ R PG ++ R++ HP+ + + DYD+ALL+L+ P S+ 
Sbjct: 601 MASTVLWTVFLGKVWQNSRW-PGEVSFKVSRLLLHPYHEEDSHDYDVALLQLDHPWRSA 659 

Query: 726 MVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQIT 785 

VRP+CLP SH F G W+TGWG + GG + LQK ++++I Q C + Q+T 
Sbjct: 660 AVRPVCLPARSHFFEPGLHCWITGWGALREGGPISNALQKVDVQLIPQDLCSEVYRYQVT 719 

Query: 786 PRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLF 845 

PRM+C G+ G D+CQGDSGGPL GR F AG+VSWG GC + N GVYTR+ 

Sbjct: 720 PRMLCAGYRKGKKDACQGDSGGPLVCKALSGRWFLAGLVSWGLGCGRPNYFGVYTRITGV 779 

Query: 846 RDWIKE 851 
WI++ 

Sbjct: 780 ISWIQQ 785 



□ > qi | 34784418 | qb | AAH5767 4 . 1 1 13 Tmprss6 protein [Mus musculus] 
Length = 799 

Score = 397 bits (1020), Expect = e-109 

Identities = 247/785 (31%), Positives = 378/785 (48%), Gaps = 75/785 (9%) 

Query: 83 DVRVQKVFNGYMR I TNENFVDAYENSNSTE FVS LAS KVKDALKLLYSGVPFLGPYHKE SA 142 

+V V +V++G +R+ N +F S F S ++K + L+ L + LG Y+ S+ 

Sbjct : 71 EVTVSQVYSGSLRVLNRHFSQDLGRRESIAFRSESAKAQKMLQELVASTR-LGTYYNSSS 129 

Query: 143 VTAFSEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXXXLPPRARSLKSFWTSWAFPTD 202 

V +F EG + ++W IP++ L S S + + + D 

Sbjct: 130 VYSFGEGPLTCFFWFILDI PEY QRLTLSPEWRELLVDELLSNSSTLASYKTEYEVD 186 

Query: 203 SK — TVQRTQDNSCSFGLHARGVELMRFTTPG — FP-DSPYPAHARCQWALRGDADSVLS 257 

+ + N G +PGPP CW L+G D ++ 

Sbjct: 187 PEGLVILEASVNDIWLNSTLGCYRYSYVNPGQVLPLKGPDQQTTSCLWHLQGPEDLMIK 246 

Query: 258 LTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLITLI 317 
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+ + C +R V +Y+ P+E + + G + +S +V+ + 

Sbjct: 247 VRLE -WTRVDCRDR VAMYDAAGPLEKRLITSVYGCSRQEPVMEVLASGSVMAWWK 301 



Query: 318 TNTERRHPGF EATFFQLPRMS-SCGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEV 372 

+ F ++ FQ +++ + GRL QG + PYYP +Y P+ C+W++ V 
Sbjct: 302 KGMHSYYDPFLLSVKSVAFQDCQVNLTLEGRL-DTQGFLRTPYYPSYYSPSTHCSWHLTV 360 

Query: 373 PNNQHVKVRFKFFYLLE PGVPAGTCPKDYVE INGEKYCGERS QFWTSNSNKITV 427 

P+ + + Y L C + I + CG R+ + + S+ +T+ 

Sbjct: 361 PSLDYGLALWFDAYALRRQKYNRLCTQGQWMIQNRRLCGFRTLQPYAERIPMVASDGVTI 42 0 

Query: 428 RFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCS 487 

F S S T G Y Y+ SDPCPG+F C 
Sbjct: 421 NFTSQISLTGPGVQVYYSLYNQSDPCPGEFLCSV 454 

Query: 488 CDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCS-NGKCLSKSQQCNG 546 

N C P CD + DC + DE+ CCA F+C + C+S + C+ 
Sbjct: 455 NGLCVP ACDGIKDCPNGLDERNCVCRAM-FQCQEDSTCISLPRVCDR 500 

Query: 547 KDDCGDGSDEASCPKVNVVTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGL 606 

+ DC +GSDE C + V C T++C + C+ K NPECDG+ DC DGSDE+ CDCGL 
Sbjct: 501 QPDCLNGSDEEQCQEG — VPCGTFTFQCEDRSCVKKPNPECDGQSDCRDGSDEQHCDCGL 558 

Query: 607 RSFTRQARWGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGFRY 666 

+ + +R+VGGT + EGEWPWQ SL G+ HICG +LI+ W+++AAHC+ +D 
Sbjct: 559 QGLS — SRIVGGTVSSEGEWPWQASLQIRGR-HICGGALIADRWVITAAHCFQEDS M 612 

Query: 667 SDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSM 72 6 

+ P WT FLG Q+ R PG ++ R+ HP+ + + DYD+ALL+L+ P YS+ 
Sbjct: 613 ASPKLWTVFLGKMRQNSRW-PGEVSFKVSRLFLHPYHEEDSHDYDVALLQLDHPWYSAT 671 

Query: 727 VRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITP 786 

VRP+CLP SH F G+ W+TGWG + GG + LQK +++++ Q C Q++P 
Sbjct: 672 VRPVCL PARS HFFEPGQHCWITGWGAQREGGPVSNTLQKVDVQLVPQDLCSE AYR YQVSP 731 

Query: 787 RMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFR 846 

RM+C G+ G D+CQGDSGGPL E GR F AG+VSWG GC + N GVYTR+ 
Sbjct: 732 RMLCAGYRKGKKDACQGDSGGPLVCREPSGRWFLAGLVSWGLGCGRPNFFGVYTRVTRVI 791 

Query: 847 DWIKE 851 
+WI++ 

Sbjct: 792 NWIQQ 796 



□ > qi | 31980537 | qb | AAP69827 . 1 1 EH matriptase-2 [Mus musculus] 

gi | 37515264 | gb | AAH29645 . 2 | B Type II transmembrane serine protease 6 [Mus musculus 
gi | 37089748 | sp | Q9DBI0 1 TMS6 MOUSE Transmembrane protease, serine 6 (Matriptase-2) 
Length = 811 

Score = 397 bits (1020), Expect = e-109 

Identities = 247/785 (31%), Positives = 378/785 (48%), Gaps = 75/785 (9%) 

Query: 83 DVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKESA 142 

+V V +V++G +R+ N +F S F S ++K + L+ L + LG Y+ S+ 

Sbjct : 83 EVTVSQVYSGSLRVLNRHFSQDLGRRESIAFRSESAKAQKMLQELVASTR-LGTYYNSSS 141 

Query: 143 VTAFSEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXXXLPPRARSLKSFWTSWAFPTD 202 
V +F EG + ++W IP++ L S S + + + D 
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Sbjct: 142 VYSFGEGPLTCFFWFILDIPEY QRLTLSPEWRELLVDELLSNSSTLASYKTEYEVD 198 



Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 

Query: 

Sbjct: 



203 S K — TVQRTQDNS CS FGLHARGVE LMR FTT PG — F P- DS PY PAHAR CQWALRGDADS VLS 257 

+ + N G +PGPP CW L+G D ++ 

199 PEGLVILEASVNDIWLNSTLGCYRYSYVNPGQVLPLKGPDQQTTSCLWHLQGPEDLMIK 258 

258 LTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLITLI 317 
+ + C +R V +Y+ P+E + + G + +S +V+ + 

259 VRLE-WTRVDCRDR VAMYDAAGPLEKRLITSVYGCSRQE PVMEVLASGSVMAWWK 313 

318 TNTERRHPGF EATFFQLPRMS-SCGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEV 372 

+ F ++ FQ +++ + GRL QG +PYYP +Y P+ C+W++ V 
314 KGMHSYYDPFLLSVKSVAFQDCQVNLTLEGRL-DTQGFLRTPYYPSYYSPSTHCSWHLTV 372 

373 PNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYCGERS — QFWTSNSNKITV 427 

p+ + + Y L C + I + CG R+ + + S+ +T+ 

373 PSLDYGLALWFDAYALRRQKYNRLCTQGQWMIQNRRLCGFRTLQPYAERIPMVASDGVTI 432 



42 8 RFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCS 487 

F S S T G Y Y + SDPCPG+F C 
433 NFTSQISLTGPGVQVYYSLYNQSDPCPGEFLCSV 



466 



488 CDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCS-NGKCLSKSQQCNG 546 

N CP CD + DC + DE+ CCA F+C + C+S + C+ 

467 NGLCVP ACDGIKDCPNGLDERNCVCRAM-FQCQEDSTCISLPRVCDR 512 

547 KDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGL 606 

+ DC +GSDE C + V C T++C + C+ K NPECDG+ DC DGSDE+ CDCGL 
513 QPDCLNGSDEEQCQEG — VPCGTFTFQCEDRSCVKKPNPECDGQSDCRDGSDEQHCDCGL 570 

607 RSFTRQARWGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYIDDRGFRY 666 

+ + +R+VGGT + EGEWPWQ SL G+ HICG +LI+ W+++AAHC+ +D 
571 QGLS — SRIVGGTVSSEGEWPWQASLQIRGR-HICGGALIADRWVITAAHCFQEDS M 624 

667 SDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSM 72 6 

+ P WT FLG Q+ R PG ++ R+ HP+ + + DYD+ALL+L+ P YS+ 
625 ASPKLWTVFLGKMRQNSRW-PGEVSFKVSRLFLHPYHEEDSHDYDVALLQLDHPWYSAT 683 

727 VRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQITP 786 

VRP+CLP SH F G+ W+TGWG + GG + LQK +++++ Q C Q++P 
684 VRPVCLPARSHFFEPGQHCWITGWGAQREGGPVSNTLQKVDVQLVPQDLCSEAYRYQVSP 743 

787 RMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLFR 846 

RM+C G+ G D+CQGDSGGPL E GR F AG+VSWG GC + N GVYTR+ 
744 RMLCAGYRKGKKDACQGDSGGPLVCREPSGRWFLAGLVSWGLGCGRPNFFGVYTRVTRVI 803 



Query: 847 DWIKE 851 

• +WI++ 
Sbjct: 804 NWIQQ 808 



□> qi|27229129|ref INP 082178.11 13 type II transmembrane serine protease 6 [Mus mus 
qi | 26338412 1 dbj I BAB23684.2 | B unnamed protein product [Mus musculus] 
Length =812 

Score = 395 bits (1014), Expect = e-108 

Identities = 248/786 (31%), Positives = 379/786 (48%), Gaps = 76/786 (9%) 
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Query: 83 DVRVQKVFNGYMRITNENFVDAYENSNSTEFVSL^KVKDALKLLYSGVPFLGPYHKESA 142 

+V V +V++G +R+ N +F S F S ++K + L+ L + LG Y+ S+ 

Sbjct : 83 EVTVSQVYSGSLRVLNRHFSQDLGRRESIAFRSESAKAQKMLQELVASTR-LGTYYNSSS 141 

Query: 143 VTAFSEGSVIAYYWSEFSIPQHLXXXXXXXXXXXXXXXLPPRARSLKSFWTSWAFPTD 202 

V +F EG + ++W IP++ L S S + + + D 

Sbjct: 142 VYSFGEGPLTCFFWFILDIPEY QRLTLSPEWRELLVDELLSNSSTLASYKTEYEVD 198 

Query: 203 SK — TVQRTQDNSCSFGLHARGVELMRFTTPG — FP-DSPYPAHARCQWALRGDADSVLS 257 

+ + N G +PGPP CW L+G D ++ 

Sbjct: 199 PEGLVILEASVNDIWLNSTLGCYRYSYVNPGQVLPLKGPDQQTTSCLWHLQGPEDLMIK 258 

Query: 258 LTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQNVLLITLI 317 

+ + C +R V +Y+ P+E + + G + +S +V+ + 

Sbjct: 259 VRLE -WTRVDCRDR VAMYDAAGPLE KR L I TS VY GCS RQE PVME VLAS GS VMAWWK 313 

Query: 318 TNTERRHPGF EAT F FQL PRMS - S CGGR LRKAQGT FNS P Y Y PGH Y PPN I DCTWNI E V 372 

+ F ++ FQ +++ + GRL QG +PYYP +Y P+ C+W++ V 
Sbjct: 314 KGMHSYYDPFLLSVKSVAFQDCQVNLTLEGRL-DTQGFLRTPYYPSYYSPSTHCSWHLTV 372 

Query: 373 PNNQHVKVR FKF F Y LLE PGVPAGT C PKD Y VE I NGE KY CGE RS QFWTSNSNKITV 427 

P+ + + Y L C + I + CG R+ + + S+ +T+ 

Sbjct: 373 PSLDYGLALWFDAYALRRQKYNRLCTQGQWMIQNRRLCGFRTLQPYAERIPMVASDGVTI 432 

Query: 428 RFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCTDHSDELNCS 487 

F S S T G Y Y+ SDPCPG+F C 
Sbjct: 433 NFTSQISLTGPGVQVYYSLYNQSDPCPGEFLCSV 466 

Query: 488 CDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCS-NGKCLSKSQQCNG 546 

N CP' CD + DC + DE+ CCA F+C + C+S + C+ 
Sbjct: 467 NGLCVP ACDGIKDCPNGLDERNCVCRAM- FQCQEDSTCI SLPRVCDR 512 

Query: 547 KDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGSDEKDCDCGL 606 

+ DC +GSDE C + V C T++C + C+ K NPECDG+ DC DGSDE+ CDCGL 
Sbjct: 513 QPDCLNGSDEEQCQEG — VPCGTFTFQCEDRSCVKKPNPECDGQSDCRDGSDEQHCDCGL 570 

Query: 607 RS FTRQARWGGTDADEGEWPWQVSLHALGQGHI CGASLI S PNWLVSAAHCY IDDRGFRY 666 

+ + +R+VGGT + EGEWPWQ SL G+ HICG +LI+ W+++AAHC+ +D 
Sbjct: 571 QGLS — SRIVGGTVSSEGEWPWQASLQIRGR-HICGGALIADRWVITAAHCFQEDS M 624 

Query: 667 SDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLELEKPAEYSSM 726 

+ P WT FLG Q+ R PG ++ R+ HP+ + + DYD+ALL+L+ P YS + 
Sbjct: 625 ASPKLWTVFLGKMRQNSRW-PGEVSFKVSRLFLHPYHEEDSHDYDVALLQLDHPWYSAT 683 

Query: 727 VRPICLPDA-SHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCENLLPQQIT 7 85 

VRP+CLP A SH F G+ W+TGWG + GG + LQK +++++ Q C Q++ 
Sbjct: 684 VRPVCLPPARSHFFEPGQHCWITGWGAQREGGPVSNTLQKVDVQLVPQDLCSEAYRYQVS 743 

Query: 786 PRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPGVYTRLPLF 845 

PRM+C G+ G D+CQGDSGGPL E GR F AG+VSWG GC + N GVYTR+ 
Sbjct: 744 PRMLCAGYRKGKKDACQGDSGGPLVCREPSGRWFLAGLVSWGLGCGRPNFFGVYTRVTRV 803 

Query: 846 RDWIKE 851 
+WI++ 

Sbjct: 804 INWIQQ 809 



□ > qi I 23957702 | ref | NP 705837.11 



transmembrane protease, serine 6 [Homo sapiens] 
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qi I 23428417 I qb | AAL16414 . 1 1 El type II transmembrane serine protease 6 [Homo sapiens 

qi 1234284091 qb | AAL16413 . 1 | type II transmembrane serine protease 6 [Homo sapiens 

qi 1 37088915 | sp | Q8IU80 | TMS6 HUMAN S Transmembrane protease, serine 6 (Matriptase-2 
Length =811 

Score = 393 bits (1009), Expect = e-107 

Identities = 249/794 (31%), Positives = 370/794 (46%), Gaps = 91/794 (11%) 



Query: 83 DVRVQKVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPFLGPYHKESA 142 

+V V +V++G +R+ N +F S+ F S +K + LK L + LG Y+ S+ 

Sbjct: 81 EVMVSQVYSGSLRVLNRHFSQDLTRRESSAFRSETAKAQKMLKELITSTR-LGTYYNSSS 139 

Query: 143 VTAFSEGSVIAYYWSEFSIPQHLXXXXX XXXXXXXXXXLPPRARSLKSF 191 

V +F EG + ++W IP+H +P RA 

Sbjct: 140 VYSFGEGPLTCFFWFILQIPEHRRLMLSPEWQALLVEELLSTVNSSAAVPYRAEY — EV 197 

Query: 192 WTSWAFPTDSKTVQRTQDN-SCSFGLHARGVELMRFTTPGFPDSPYPAHARCQWALRG 250 

+V K + C + +++R PS C W L+G 

Sbjct: 198 DPEGLVILEASVKDIAALNSTLGCYRYSYVGQGQVLRLKGPDHLASS CLWHLQG 251 

Query: 251 DADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHALVQLCGTYPPSYNLTFHSSQN 310 

D +L L + LA C +R + +Y+ P+E + + G + +S 

Sbjct: 252 PKDLMLKLRLE-WTLAECRDR LAMYDVAGPLEKRLITSVYGCSRQEPWEVLASGA 306 

Query: 311 VLLITLITNTERRHPGF EATFFQLPRMSSCGGRLRKAQGTFNS PY Y PGHYPPNI DC 366 

++ + + F + FQ ++ +QG ++PY+P +Y P C 

Sbjct: 307 IMAWWKKGLHSYYDPFVLSVQPWFQACEVNLTLDNRLDSQGVLSTPYFPSYYSPQTHC 366 

Query: 367 TWNI E V PNNQHVKVR FKF F Y LLE PGVPAGT C PKDY VE I NGE KY CG ERSQFW 418 

+W++ VP+ + + YL C + I + CG ER V 

Sbjct: 367 SWHLTVPSLDYGLALWFDAYALRRQKYDLPCTQGQWTIQNRRLCGLRILQPYAERIPWA 426 

Query: 419 TSNSNKITVRFHSDQSYTDTGFLAEYLSYDSSDPCPGQFTCRTGRCIRKELRCDGWADCT 47 8 

T+ IT+ F S S T G Y Y+ SDPCPG+F C 
Sbjct: 427 TAG ITINFTSQISLTGPGVRVHYGLYNQSDPCPGEFLCSV 466 

Query: 47 9 DHSDELNCSCDAGHQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGCSCPAQTFRCS-NGKC 537 

N C P CD V DC + DE+ CCA TF+C + C 
Sbjct: 467 NGLCVP ACDGVKDCPNGLDERNCVCRA-TFQCKEDSTC 503 

Query: 538 LSKSQQCNGKDDCGDGSDEASCPKVNWTCTKHTYRCLNGLCLSKGNPECDGKEDCSDGS 597 

+S + C+G+ DC +GSDE C + V C T++C + C+ K NP+CDG+ DC DGS 
Sbjct: 504 ISLPKVCDGQPDCLNGSDEEQCQEG — VPCGT FT FQCEDRS CVKKPNPQCDGR PDCRDGS 561 

Query: 598 DEKDCDCGLRS FTRQARWGGTDADEGEWPWQVS LHALGQGHI C GAS LI S PNWLVSAAHC 657 

DE+ CDCGL+ +R+VGG + EGEWPWQ SL G+ HICG +LI+ W+++AAHC 

Sbjct: 562 DEEHCDCGLQG — P S S R I VGGAVS S E GE W PWQAS LQVRGR - H I C GGAL I ADRWV I T AAH C 618 

Query: 658 YIDDRGFRYSDPTQWTAFLGLHDQSQRSAPGVQERRLKRIISHPFFNDFTFDYDIALLEL 717 

+ +D + WT FLG Q+ R PG ++ R++ HP+ + + DYD+ALL+L 

Sbjct: 619 FQEDS MASTVLWTVFLGKVWQNSRW- PGEVS FKVSRLLLHPYHEEDSHDYDVALLQL 674 

Query: 718 EKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQTTCE 777 

+ P S+ VRP+CLP SH F G W+TGWG + GG + LQK ++++I Q C 
Sbjct: 675 DHPWRSAAVRPVCLPARSHFFEPGLHCWITGWGALREGGPISNALQKVDVQLIPQDLCS 734 

Query: 778 NLLPQQITPRMMCVGFLSGGVDSCQGDSGGPLSSVEADGRIFQAGWSWGDGCAQRNKPG 837 

+ Q+TPRM+C G+ G D+CQGDSGGPL GR F AG+VSWG GC + N G 

Sbjct: 735 EVYRYQVTPRMLCAGYRKGKKDACQGDSGGPLVCKALSGRWFLAGLVSWGLGCGRPNYFG 794 
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